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1.0 Introduction 

An assembler is a translator whose source language is assembly language 
and whose object code is actual machine language. Assembly language is mostly 
a one-for-one representation of machine language written in a symbolic form. 
Its value comes from being easier to read and from the facilities provided by 
the assembler for doing calculations at assembly time . These range from simple 
address calculations to complex conditional assemblies in which totally 
different object programs may be generated, with the choice among them 
depending on the values of a few parameters. 

This section serves to define the terminology used. It is assumed that 
the programmer is familiar with the basic characteristics of the SDS 9^0 . 

1.1 Basic Description of the Assembler 

The assembler is a two-pass assembler with subprogram, literal, 
macro, and conditional assembly capabilities. 

1.2 Symbols 

Numbers may be represented symbolically in assembly language by 
symbols . A symbol is any string of letters and digits not forming a 
constant. (Constants are defined in Section *4-.2). In particular, it 
is not necessary that a symbol begin with a letter. Although symbols 
as written may be arbitrarily long, only the first six characters of a 
symbol are used to distinguish it from others. When a symbol is used to 
represent a memory address, it is called a label . Examples of symbols 
are: 

START Z1C A12 CALCULATE 



* Ref . to SDS 9k0 Computer Reference Manual, No. 90 06 UOA, August, 1966. 
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1-3 Instructions, Directives, and Comments 

Input to the assembler takes the form of a sequence of statements 
called instructions , directives , or comments . Instructions are symbolic 
representations of machine commands and are translated by the assembler 
into machine language. Directives, by contrast, are messages which serve 
to control the assembly process or create data. They may or may not 
generate output. Comments are ignored by the assembler, and serve only 
to clarify the meaning of a program. 
l.k Subprograms 

Programs often become quite large or fall into logical divisions 
which are almost independent. In either case it is convenient to break 
them into pieces and assemble (and even debug) them separately. Separately 
assembled parts of the same program are called subprograms . 

Before a program assembled in pieces as subprograms can be run it is 
necessary to load the pieces into memory and link them. The symbols used 
in a given subprogram are generally local to that subprogram. Subprograms 
do, however, need to refer to symbols defined in other subprograms. The 
linking process takes care of such cross references. Symbols used for it 
are called external symbols . 

1.5 Literals 

Often data is placed in programs at assembly time. It is frequently 
convenient to refer to constants by value than by label. A literal is a 
symbolic reference to a datum by value. The assembler allows any type of 
expression to be used as a literal. Some examples of literals are: 
=5 =3*XYZ-2 ='END ! =EXTERN 

1.6 Relocation 

A relocatable program is one in which memory locations have been 
computed relative to the first word or origin of the program. A loader 
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(for this asser;foler, DDT) can then place the assembled program into 
core "beginning at whatever location may be specified at load time. 
Placement of the program involves a small calculation. For example, 
if a memory reference is to the nth word of a program, and if the program 
is loaded beginning at location k, the loader must transform the reference 
into absolute location n+k. 

This calculation should hot be done to each word of a program since 
some machine instructions (shifts, for example) do not refer to memory 
locations. It is therefore necessary to inform the loader whether or not 
to relocate the address for each word of the program. Relocation infor- 
mation is determined automatically by the assembler and transmitted to 
the loader as a binary quantity called the relocation value . If R = 1 
the operand is to be relocated; if R = the operand is absolute. 

Constants or data may similarly require relocation, the difference 
here being that the relocation calculation should apply to all 2k bits of the 
9I4-O word, not just to the address field. The assembler accounts for this 
difference automatically. 

It is possible to disable relocation in the assembler and to do 
absolute assembly . In this event there is an option which produces a 
paper tape which can be loaded using the 9^0 fill switch. 
1.7 Basic Assembly Procedure 

During pass 1 of the two-pass process the operands of instructions and 
some directives are scanned for the presence of single symbols. If a single 
symbol is present, a table of symbols is searched. If absent, the symbol is 
added to the table but marked as not yet defined, i.e., having no value. 
Labels are placed into the symbol table in similar fashion, except that 
they are assigned the current value of the location counter , a word within 
the assembler which contains the relative address of the instruction. If 
a label has been previously defined, it is marked as a duplicate symbol 
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(this is taken to be an error). 

At the end of pass 1 the symbol table is sorted. All symbols present 
hdving no value are assumed to be external. These symbols are then output 
by the assembler for later use by the loader. During pass 2 the labels 
are not computed; rather, the operand fields of instructions and directives 
are evaluated using the now known symbol values. 

In absolute assemblies the scan for single symbols in pass 1 is 
disabled. This has the effect of doing away with external symbols. 
1.8 Notation 

In the following pages, square brackets [ ] are used to indicate the 
presence of optional quantities. 
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2.0 The Assembly Language 

2.1 Character Set 

The classes of characters recognized by the assembler are as follows: 

(a) digits 

(1) octal 0-7 

(2) decimal 0-9 

(b) letters A-Z 

(c) alphanumerics 0-9 and A-Z 

(d> delimiters + - * / , » () = .$ blank <- 

(e) special characters : ; < > ? [ ] " 
Note that the characters ! # % & <§> \ t which are normally found on standard 
Teletypes are not recognized by the assembler. Use of them in a program 
will result in their being replaced by blanks. 

2.2 Statements 

Statements are logical units of input. They may be delimited either 

by being placed on separate lines or by being separated with semi-colons. 

Semi-colons do not serve as statement delimiters when used between single 

quotes (as in the TEXT directive) or inside of matched parentheses (as in 

arguments of macro calls). Examples of statements are 

START LDA DAT21 
MLTL 213 
STA ANSWER 

or 

START LDA DAT21; MUL 233; STA ANSWER 

If a statement requires more than one line for any reason, it can be 

continued on the next line by typing a + in the first column of the next line 

Thus: 

START LDA DAT21; MUL 213; STA ANSWER THE C1QM 
•4MENT ON THIS LINE REQUIRES A CONTINUATION 

This kind of continuation may be done for about five lines (320 characters). 
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Each non-blank statement is an instruction, a directive, or a 
comment. Blank statements are ignored. Comments "begin with an asterisk; 

they have absolutely no effect on the program being assembled and serve 

only as annotations to clarify the meaning of the assembly language. 

Directives and instructions are divided into four fields. The 
fields are, from left to right, the label field, the operation field, the 
operand field, and the comment field. The assembler is a free- form 
assembler; its various fields are delimited by blanks rather than 
restricting them to fixed places in a line. This is explained in more 
detail below. 

The label field is used mostly for symbol definitions. It begins 
with the first character in the statement and ends on the first non- 
alphanumeric character. (The blank is usually the only legal terminator.) 
Thus,, in the following statements the symbol XYZ appears in label fields. 

XYZ LDA =10 
STA DEFjXYZ LDA =10; LDB* LMN 

The operation field contains (usually) a symbolic operation code or 
directive name. It begins with the first non-blank character after the 
termination of the label field. In the statements above, each operation 
field begins in a different position. Like the label field, the operation 
field terminates on the first non- alphanumeric character. Legal 
terminators are the blank, asterisk, semi-colon, and carriage return. 

The operand and comment fields each begin with the first non-blank 
character after the termination of the preceding field. The operand 
field terminates on the first blank or semi-colon not between matched 
single quotes or parentheses. The carriage return always terminates the 
field (and the statement). The comment field terminates on a semi-colon 
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or carriage return. This field, like the comment statement, is not used 
by the assembler; it may contain anything. 

2.3 Programs 

A program consists of a sequence of statements terminated by an END 

directive. Normally programs are assembled in relocatable form. A 

program is assembled in absolute self-loading- form if it begins with an 

ORG directive. It is possible (by using RELORG) to make an absolute 

assembly to be loaded by DDT. 
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3*0 The Syntax of Instructions 
3-1 Their Classification 

(a) Class 1 (normal instructions). 

Class 1 instructions in general use the operand field. Its 
absence implies the value zero. It is possible to specify for each 
Class 1 instruction whether or not the operand field must he present. 
It is also possible to specify that bit of the instruction word is 
to be set to one (as in SYSPOPs). There are two types of Class 1 
instructions: 

(1) type 

Ik 
The address is formed mod 2 . All instructions 

making memory references are of this type. 

(2) type 1 

9 
The operand is formed mod 2 . This type is used for 

shift instructions. If indirect addressing is used with 

lif 
this type, the address is formed mod 2 

Class 1 instructions have the following form: 

[[$] label] opcode [*] [operand[,tag]] [comment] 

Indirect addressing is signified by an asterisk immediately 

following the operation code or by preceding the operand with <- . 

The use of the dollar sign is explained in 3*2 The tag is used 

to specify bits 0, 1 and 2 of the 9^0 instruction word. 

(b) Class 2 (complete or full word instructions). 

Class 2 instructions have no operand field. Indirect addressing 
is signified by an asterisk immediately following the operation 
code. Class 2 instructions have the following form: 
[[$] label] opcode [*] [comment] 
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(c) Numeric op codes. 

Operation codes may "be specified as decimal or octal numbers, 
as for example: 

[[$] label] 76BO] [operand^ tag]] [comment] 

The assembler shifts the numeric op code (modulo 177o) left to 
the correct position in the instruction word. In such cases, the 
op code is assumed to be Class 1, type 0, no operand required, 
and with bit not set. 
3-2 Use of the Label Field 

A label identifies the instruction or data word being generated. The 
symbol used in the label field is given the current value of the location 
counter. Instructions will have labels normally if they are referred to 
elsewhere in the program, although it is not necessary that symbols defined 
in this way be used in references. Symbols defined but not used are called 
nulls ; they are marked as such in the assembly listing and explicitly 
typed out at the end of an assembly. 

If the same symbol appears in the label field of more than one 
instruction, it is marked as a duplicate and given the newer value. 

A $ preceding a label causes an external symbol definition (cf. 6.6). 
3.3 Operand Field 

The operand field contains at most two arithmetic expressions (or a 

literal and one expression) used to determine the operand and tag of the 

3 
machine command. The tag, if present, is evaluated mod 2 and must be 

absolute (i.e. n on- relocatable) . 

3.1+ Alternate Conventions for Expressing Indexed & Indirect Addresses 

It is possible to express both the use of indexing and indirect 

addressing in an alternative manner. In each case a special character 
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is placed at the beginning of the operand field. These characters are / 
for indexing and <- for indirect addressing. Thus, for example, 

LDA VECTOR, 2 is the same as LDA /VECTOR 
and 

STA* POINTR is the same as STA *-POINTR 
Similarly, 

LDA* C0MFLX,2 may be written either as 
LDA /f-COMPLX 
or LDA ^/COMPLX v 
Anything normally useful may follow the initial <- or /, for example 

LDA — =CHAIN (LDA* =CHAIN) 

This alternate way of expressing indexing and indirect addressing 
may be used by programmers as they choose. It was devised to iimplify 
the indication of these operations in the use of macros (see chapter 7)« 

3 «5 Comment Field 

The comment field is not processed by the assembler, but is copied 
to the assembly listing. 
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k.O Expression Syntax 



The assembler evaluates expressions as 2^-bit, signed integers. Expressions 

consist of constants and symbols connected by operators. Examples of expressions 

are: 

100- 2*ABC (OR )DEF/27B 

22 

C12>D19 

Expressions are evaluated from left to right, some operators taking precedence 

over others. As an expression is evaluated, a parallel calculation of its 

relocation value R is made. Only absolute expressions (R = 0) and relocatable 

expressions (R = l) are legal (cf. k.7)> 

k.l Operators 

The operators recognized by the assembler and their precedence are 

given below. Operators of highest precedence are applied first in 

evaluation of expressions. 



Operator Precedence 



(a) unary 



(b) relational 



(c) binary 



- 






h 




(not) 






k 




(R) 






k 


(cf. k.7) 


(lss) 


or 


< 


3 




(grt) 


or 


> 


3 




(EQU) 


or 


=: 


3 




* 






2 




/ 






2 




(and) 






2 




+ 






1 




- 






1 




(OR) 

(eor) 






1 
1 
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Note that some operators are more than one character long. These 
are enclosed in parentheses to avoid confusion with symbols which would 
otherwise look the same. Parentheses are therefore not allowed in 
expressions to delineate terms and modify the order of evaluation. 

The relational operators give rise to a value 1 if the relation is 
true and if false. There may "be only one relational operator in an 
expression . 
h.2 Constants 

Constants are of three types: . 

(a) decimal integers: one or more decimal characters possibly 
terminated with the letter D. 

2129, 600D, -217 

(b) octal integers: one or more octal characters possibly terminated 
with the letter B and optionally a single -digit octal scaling 
factor. 

217, 32B, *fB3 (which is the same as IfOOOg) 

(c) string: ' 1-lf characters (except ')' 

All constants are absolute, i.e., their relocation value is 0. 

The assembler normally expects integers to be decimal. This can 
be changed, however, by using a directive (OCT or DEC). In any case, 
integers may be terminated with B or D, overriding the normal inter- 
pretation of integers. String constants are not normally useful in the 
direct computation of memory addresses, but exist basically to be used 
in literals (cf. 5-0)* 
If. 3 Classification of Symbols 

The assembler recognizes the following types of symbols: 

(a) local symbols: These symbols are defined by their use in the 
label field of instructions and in some directives. Their 



R-26 
>+-3 



value is that of the location counter at their definition. They 
are thus symbolic addresses of memory cells. These symbols are 
relocatable (R = l) if the assembly is relocatable; if the 
assembly is absolute, they are absolute. Once having been 
defined, a local symbol may not be redefined. Attempts to do so 
are considered errors, and diagnostics result. 

(b) equated symbols: Equated symbols may be defined by equating 
them to an expression (using directives EQU, NARG, or NCHR). 
Their relocation value will be that of the expression. Unlike 
local symbols, equated symbols may be given new values at any 
point in the program. 

(c) current location counter symbol (■*) : The character *, if used 
in the proper context, is understood to mean the current value 
of the location counter. It is relocatable or absolute 
depending on the nature of the assembly. 

(d) external symbols: External symbols are those which are used 
but not defined in a given subprogram. They can be assigned 
no value, and it is not reasonable to regard them either as 
absolute or relocatable. External symbols may be used only as 
the sole object in an expression; other than its appearance as 
a sole object, the external symbol may not be used in an 
expression. 

Ij-.U Terms 

Terms are either constants or symbols, optionally preceded by a unary 
operator. The unary operator serves to modify both the value of the term 
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and its relocation value. One unary operator — special relocation, (r) — 
may set the relocation value of a term to any value. This feature is 
explained in much more detail in k.J- 
h.5 Expressions 

Expressions may consist of one or more terms connected "by binary operators, 
or they may he just a single external symbol. Their evaluation proceeds 
from left to right using operators of decreasing precedence. For example, 
let A = 100, B = 200, and C = -1. Then 

A+B*C/A =98 
Again, letting A = 5^321 g , B = W^g, and C = 00077o, then 

a(or)b(and)c = 5^3653 

^.6 Constraints of Relocatability of Expressions 

The implementation of the assembler forces the following constraints 
on the use of expressions: 

(a) No relocatable term (R = l) may occur in conjunction with the 
operators * or /. In other words, no relocatable symbol may 
multiply, be multiplied by, divide, or be divided by anything. 

(b) In the absence of the special relocation operator (r) the 
final relocation value of an expression may be only or 1. 
It is possible that the relocation value may attain other 
values in the course of evaluation. 

(c) If the special relocation operator (r) appears in an expression, 
then the relocation value of the expression may be either or 
some other value K, where K is the special relocation radix. DDT 
is informed by the assembler that special relocation is being used 
in this case. DDT will then multiply the base address by K 
before adding it to the value of the expression (see next section). 
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k.f Special Relocation 



The special relocation feature has "been provided to permit the 
programmer limited use of expressions which are not absolute or singly 
relocatable. To see why this is desirable, and how it works, consider 
the process of assembling and loading a relocatable program. Let the 
symbol A have value a. If one writes 

LDA A 
the assembler produces 

076 a 
and marks the instruction's address as being relocatable. Later when 
told to load the program beginning at base address b, DDT will form 

076 a+b 
Thus no matter where the program is loaded, the memory reference will be to 
the ath word from the base address. 
Now suppose one writes 

LDA 2*A 
The assembler, of course, can form 

076 2*a 
and presumably what DDT should form is 

O76 2*a+2*b = 076 2* (a+b) 
To do this, it must be told that b is to be multiplied specifically by 2. 
Only one bit is reserved, however, for such information in the assembler's 
binary output; it is this fact which causes the restriction that 
expressions may have only the relocation values and 1. And this 
restriction can be gotten around (inelegantly) by the use of (r) . 
The following example give's one of the main reasons for which (r) was. 
put into the assembler. 
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Programs may make use of the string-handling SYSPOPs of the 9k0. 
These instructions use string pointers , two-word objects containing 
starting and ending character addresses . Now characters are packed 
three per word. A character address therefore consists of the memory 
address containing the character multiplied "by 3 plus 0, 1, or 2 
depending on the position of the character in the word. If a character 
address is divided by 3> the quotient gives the word address and the 
remainder the character position in the word. 

To form a character address at assembly time, one must be able to 
multiply a word address (a relocatable item) by a constant (in this 
case, 3)« This is the reason for special relocation. The statement 

DATA (R)A+1 
will produce the value 

3*a+l 
together with a notation to DDT that special relocation applies to that 
value . 

DDT will then form the value 
(3*a+l)+3*b = 3*(a+b)+l 
symbol, representing a relocatable word address, may thus be used to form 
character addresses in string pointers. There are other examples for the 
need for special relocation, but they will not be mentioned here. Let it 
suffice to say that special relocation is merely a device to make up 
partially for the rather severe relocation constraints the assembler 
imposes upon programmers . 

It should be pointed out that the multiplicative constant associated 
with (r) in the example above was 3 because of the nature of string 
pointers. This constant is called the special relocation radix . It need 
not be 3 always. In fact, it may be changed to any value by the directive 
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RAD. Because of the relative importance of string pointers, however, 
the assembler is initialized with this value set to 3; it is hence 
unnecessary to use RAD to set it to 3 unless it has been changed for 
some reason. 
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5 .0 Literals 

Programmers frequently write such things as 
LDA FIVE 
where FIVE is the name of a cell containing the constant 5. The programmer 
must remember to include the datum FIVE in his program somewhere. This can 
he avoided by the use of a literal. 

LDA =5 
will produce automatically a location containing the correct constant in the 
program. Such a construct is called a literal. 
Literals are of the form 

=expression 
When encountering a literal, the assembler first evaluates the expression and 
looks up its value in a table of literals constructed for each subprogram. 
If it is not found in the table, the value is placed there. In any case the 
literal itself if replaced by the location of its value in the literal table. 
At the end of assembly the literal table is placed after the sub-program. 
The following are examples of literals: 

=10 =hB6 =ABC*20-DEF/l2 ='HELP'- 



=2=AB (This is a conditional literal. Its value will be 1 or 
depending on whether 2=AB at assembly time.) 

Some programmers tend to forget that the literal table follows the 

subprogram. This could be harmful if the program ended with the declaration 

of a large array using the statement 

ARRAY BSS 1 

It is not strictly correct to do this, but some programmers attempt it anyway 

on the -theory that all they want to do is to name the first cell of the array. 

The above statement will do that, of course, but only one cell will be reserved 

for the array. If any literals were used in the subprogram, they would be 
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placed in the following cells which now fall into the array. This is, of 
course, an error. Other than the above exception, the programmer need not 
concern himself with the locations of the literal values. 



R-26 
6-1 



6 . Directives 

There is a large number of directives associated with this assembler. 
Although many of the directives are similar, each in general has its own 
syntax. A concise summary is given "below: 

Class Directive Use/Function 



Data Generation: 



Value Declaration: 



Assembler Control: 



Output & Listing 

Control: 



Macro Generation 
& Conditional 
Assembly: 



COPY 
DATA 
TEXT 
ASC 

EQU 

EXT 

NARG 

NCHR 

OFD 

POPD 

BES 

BSS 

ORG 

END 

DEC 

OCT 

RAD 

FRGT 

IDENT 

DELSYM 

RELORG 

RETREL 

FREEZE 

NOEXT 

LIST 
NOLIST 
PAGE 
REM 

MACRO 

ENDM 

RPT 

CRPT 

ENDR 

IF 

ELSF 

ELSE 

ENDF 



Facilitates use of RCH command 
Generation of data 
Generation of text 
Generation of text 

Setting or changing symbol values 

Defining external symbols 

See 

See 

Defining new op codes 

Defining pox> codes 

Block ending symbol 

Block starting symbol 

Origin: absolute assembly 

End of program 

Interpret integers as decimal 

Interpret integers as octal 

Set special relocation radix 

Forget name of symbol 

Identify name of program 

Do not transmit symbols to loader 

See 6.21 

See 6.22 

Preserve symbols and macros 

Do not create external symbols 

Set listing flags 

Reset listing flags 

Skip to new page on listing 

Type out remarks in pass 2 

Head of macro body 

End of macro body 

Begin repeat body 

Begin conditional repeat body 

End repeat body 

Begin if body 

Alternative if body 

Alternative if body 

End of if body 
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6.1 COPY Generalized Register Change Command 

[[$] label] COPY s^s^s ,... [comment] 

•where s. are symbols from a special 
set associated with the COPY directive 

The COPY directive produces an RCH instruction. It takes in its operand 

field a series of special symbols, each standing for a bit in the address 

field of the instruction. The bits selected by a given choice of symbols 

are merged together to form the address. For example, instead of using 

the instruction CAB (0U60000^), one could write COPY AB. The special 

symbol AB has the value OOOOOOOU. 

The advantage of the directive is that unusual combinations of bits 

in the address field — those for which there exist normally no operation 

codes -- may be created quite naturally. The special symbols are mnemonics 

for the functions of the various bits. Moreover, these symbols have this 

special meaning only when used with this directive; there is no restriction 

on their use either as symbols or op codes elsewhere in a program. The 

symbols are: 

Symbol Bit Function 

Clear A 

Clear B 

Copy (A) -» B 

Copy (B) -» A ' 

Copy (B) ->X 

Copy (x) -*B 

Bits 15-23 (exponent part) only 

Copy (X) ->A 

Copy (A) ->X 

Copy -(A) -> A (negate A) 

Clear X 

To exchange the contents of the B and X registers, negate A, and only 

for bits 15-23 of all registers, one would write 

COPY BX,XB,N,E 



A 


23 


B 


22 


AB 


21 


BA 


20 


BX 


19 


XB 


18 


E 


17 


XA 


16 


AX 


15 


N 


ik 


X 


2 
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Of course, the symbols may be written in any order. 

Clever programmers please note: This directive facilitates nicely 
some special RCH functions which might not otherwise be attempted (it 
is usually too much trouble). For example, 

COPY AX,BX 
has the effect of loading into X the logical OR (merging) of the A and B 
registers. Interested readers are referred to the SDS 9^0 manual for more 
details of the RCH instruction. 

6.2 DATA Generate Data 

[[$]label] DATA e-,e ,e ,. .. [comment] 
The DATA directive is used to produce data in programs. Each expression 
in the operand field is evaluated and the 2*4— bit values assigned to 
increasing memory locations. One or more expressions may be present. 
The label is assigned to the location of the first expression. The effect 
of this directive is to create a list of data, the first word of which may 
be labeled. 

Since the expressions are not restricted in any way, any type of 
data can be created with this directive. For example: 
DATA 100, - 217B, START, AB*2/DEF, ' NUTS ' , 5 

6.3 TEXT Generate Text 

[[$] label] TEXT 'text' [comment] 
or, 

[[$]label] TEXT expression, text [comment] 
The TEXT directive is used to create a string of 6-bit trimmed ASCII 
characters, packed four to a word and assigned to increasing memory 
locations. The first word of the string may be labeled. The string to be 
packed may be delineated either by enclosing it in quotes (as in the first 
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case above) or by preceding it with a word count (as in the second case). 

The second form of the directive must be used, of course, if the string 

contains one or more quotes. A potential hazard arising here should be 

pointed out. If a statement contains a single quote (or any odd number 

of them), it will not terminate with a semi-colon; a carriage return must 

be used. 

TEXT *4-,THIS WON'T WORK; TEXT k, DISASTER AHEAD 

In the line above the semi-colon will be part of the text, and the second 

statement will be interpreted as being in the comment field, 

TEXT U,THIS WILL ' 
TEXT 1,A-0K 

In the first form of the directive, characters in the last word are 

left- justified and remaining positions filled in by blanks (octal 00). 

In the second form, sufficient characters are packed to satisfy the word 

count. 

6.^- ASC Generate Text with Three Characters per Word 

This directive is identical in form and use to TEXT, except that 
8-bit characters are packed three per word . The 9^0 string processing 
system normally deals with such text. 

6.5 EQU Equals 

[$] symbol EQU expression [comment] 
The EQU directive causes the symbol in its label field to be defined 
and/or given the value of the expression. The expression must have a 
value when EQU is first encountered; i.e., symbols present in it must have 
been previously defined. It is permissible to redefine by EQU any symbol 
previously defined by EQU (or NARG or NCHR, cf . below). This ability is 
particularly useful in macros and conditional assembly. 
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6.6 EXT Define External Symbol 

There are four ways which may be used to define external symbols. 

(a) $label opcode or directive operand, etc. 

The $ preceding the label causes the symbol in the label field 
to be defined externally at the same time it is defined locally. 

(b) symbol EXT (comment not permitted) 

The symbol given in the label field is defined externally. 

This symbol must have been defined previously in the program. 

The operand and comment fields must be absent. 
Both of the above forms have the same effect; the name and value of a local 
symbol is given to the loader for external purposes. 

Occasionally it is desirable to define an external symbol whose name 
is different from that of a local symbol; or an external symbol may be 
defined in terms of an expression involving local symbols. There are 
two ways of doing this. 

(c) $symbol EQ.U expression [comment] 

(d) symbol EXT expression [comment] 

In (c) above the symbol is defined both locally and externally at the same 
time. (d) differs subtly in that the symbol in the label field is defined 
only externally; its name and value are completely unknown to the local 
program. 

The feature (d) above is particularly useful in situations where two or 
more subprograms loaded together have name conflicts . For example, suppose 
programs A and B both make use of the symbol START, and A not only refers 
to its own START but B's as well. The latter references can be changed to 
BEGIN. Then into program B can be inserted the line 

BEGIN EXT START 
No other changes need be made either to A or B. 
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Occasionally, after having written a program, one would like to make 

a list of local symbols to be externally defined. A built-in macro ENTRY 

serves this function. That it is a built-in macro is irrelevant; the 

programmer may think of it as a related directive. Thus 

ENTRY A,B,C,D, ... 

is precisely equivalent to 

A EXT . 

B EXT 

C EXT 

D EXT 



6.7 NARG Equate Symbol to Number of Arguments in Macro Call 

[$] symbol NARG [comment] 
This directive may be used only in macro definitions. It is mentioned 
here only for completeness. It operates exactly as EQU except that in 
place of an expression in the operand field, the value of the symbol is 
set to the number of arguments used in calling the macro currently being 
expanded. Cf . 7.9 below. 

6.8 NCHR Equate Symbol to the Number of Characters in Operand 

[$] symbol NCHR operand [comment] 
This directive is intended for use mostly in macro definitions, but it 
may be used elsewhere. It operates exactly as EQU except that in place 
of an expression in the operand field, the value of the symbol is set to 
the number of characters included in the operand field. A further 
explanation of the utility of this directive is deferred to section 7- 



R-26 
6-7 



6.9 OPD Operation Code Definition 

The OFD directive gives the programmer the facility to add to the 
existing table of operation codes kept in the assembler new codes or to 
change the equivalences of current ones. The form of OFD is: 

opcode OFD expression, class [ , ar[,type[, sb] ]] [comment] 
where: l) class must be 1 or 2 (cf. Section 3«l)« 

2) ar (address required) may be or 1 

3) type may be or 1 (cf. Section 3«l)» 
h) sb (sign bit) may be or 1 

Quantities governed by the optional terms above (2,3 and k) are set to 
zero if the terms are missing. As examples of how the directive is used, 
some standard machine instructions are defined as follows: 

CLA OFD 046000013, 2 

LDA OFD 76B5,1,1 

RCY 0PD 662B4, 1,1,1 (TYPE 1 = SHIFT) 
A hypothetical SYSPOP LLA might be defined by 

LLA OFD 110B5, 1^1^0,1 
(class 1, address required, type 0, sign bit set). 

In operation, the assembler simply adds new op codes defined by OPD 
to its opcode table. This table is always searched backward, so the new 
codes are seen first. At the beginning of the second pass the original 
table boundary is reset; thus if an opcode is redefined somewhere during 
assembly, it is treated identically in both passes. 

6.10 POFD Programmed Operator Definition 

In programs containing POPs it is desirable to provide the POFD 
directive. This directive works exactly like OPD and is used in the same 
way. Its essential difference from OPD is that it places automatically 
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in the POP transfer vector (IOOq - ^-T7o) a branch instruction to the body 
of the POP routine. 

In order to do this the assembler must know two things: 

(1) the location for the branch instruction in the transfer vector and 

(2) the location of the POP routine (i.e. the address of the branch 
instruction). 

Item (l) is given by the POP code itself. Item (2) is provided by the 
convention that the POPD must immediately precede the body of the POP 
routine. The address of the branch instruction placed in the transfer 
vector is the current value of the location counter. 

If the automatic insertion of a word in the POP transfer vector is 
not desired, then OPD should be used instead. An example of this case 
would occur in a subprogram containing a POP whose routine is found in 
another subprogram. 
6.11 BES Block Ending Symbol 

[[$]label] BES expression [comment] 
The use of BES reserves a block of storage for which the first location 
after the block may be labeled (i.e. if the label is given). The block 
size is determined by the value of the expression; it must therefore be 
absolute, and it must have a value when BES is first encountered, (symbols 
present must have been previously defined). BES is most useful for 
labeling a block which is to be referred to by indexing using the BRS 
instruction (where the contents of X are usually negative). For example, 
to add together the contents of an array one might write: 

LDX =-100 ARRAY HAS 100 ENTRIES 

CLA 

LOOP ADD ARRAY, 2 NEGATIVE mDEXING HERE 

BRX *-l 

STA RESULT 

HLT 

ARRAY BES 100 
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6.12 BSS Block .Starting Symbol 

[[$]label] BSS expression [comment] 
The use of BSS reserves a block of storage for which the first word may- 
be labeled (if the label is given). The block size is determined by the 
value of the expression; it must therefore be absolute, and it must have 
a value when BSS is first encountered. The difference between BSS and BES 
is that in the case of BSS the first word of the block is labeled, whereas 
for BES the first word after the block is labeled by the associated symbol. 
BSS is most useful for labeling a block which is referred to by positive 
indexing (cf. 6.11 above). 

6.13 ORG Program Origin 

ORG expression [comments] 
The use of ORG forces an absolute assembly. The location counter is 
initialized to the value of the expression. The expression must therefore 
be absolute, and it must have a value when ORG is first encountered. 
An ORG must precede the first instruction or data item in an absolute 
program, although it does not necessarily. have to be the first statement. 
The output of the assembler will have a bootstrap loader at the front 
which is capable of loading the program after initiation by the 9^-0 
FILL switch. 

6.1*1 END End of Assembly 
END [expression] 
The END directive terminates the assembly. For relocatable assemblies, 
no expression is used. For absolute assemblies the expression gives the 
starting location for the program. When assembling in absolute mode, 
the assembler produces a paper tape which can be read into the machine 
with the FILL switch, i. e., out of the time-sharing mode. If the 
expression is not included with the END directive, the bootstrap loader 
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on this paper tape will h^lt after the tape has read in. Otherwise control 
will automatically transfer to the location designated in the expression. 

6.15 DEC Interpret Integers as Decimal 

DEC [comments] 
Integers terminated with B or D are always interpreted respectively as 
"being octal or decimal. On the other hand, integers not terminated with 
these letters may "be interpreted either as decimal or octal depending on 
the setting of a switch inside the assembler. The mode controlled by this 
switch is set to decimal by the above directive. 

When the assembler is started this mode is initialized to decimal. 
Thus, the DEC directive is not really necessary unless the mode has been 
changed to octal and it is desired to return it to decimal. 

6.16 OCT Interpret Integers as Oc tal 

OCT [ comments ] 
As noted in 6.15 above, this directive sets a mode within the assembler 
to interpret unterminated integers as octal. When the assembler is 
started this mode is initialized to decimal. Thus, the OCT directive 
must be used before unterminated octal integers can be written. 

6 . 17 RAD Set Special Relocation Radix 

RAD expression [comment] 
As explained in ^.7 it is possible in a limited way to have multiple- 
relocated symbols. This action is performed when the special relocation 
operator (r) is used. The value of a symbol preceded by (r) is multiplied 
by a constant called the radix of the special relocation. The loader is 
informed of this situation so that it can multiply the base address by this 
same constant before performing the relocation. Because the special 
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relocation was developed specifically to facilitate the assembly of string 
pointers (cf . k.7), this constant is initialized to 3* If it is desired 
to change its value, however, the RAD directive must be used. The value 
of the expression in the operand field sets the new value of the radix. 
It must be absolute, and the expression must have a value when it is 
first encountered. 

6.18 FRGT Forget Name of Symbol 



FRGT s 1 , s 2 , s , . . . [comment] 



where s . are previously defined symbols 
The use of FRGT prevents the symbol(s) named in its operand field from 
being listed or delivered to DDT. FRGT is especially useful in situations, 
for example, where symbols have been used in macro expansions or conditional 
assemblies. Frequently such symbols have meaning only at assembly time; 
they have no connection whatever with the program being assembled. When 
DDT is later used, however, memory locations sometimes are printed out 
in terms of these meaningless symbols. It is desirable to be able to • 
keep these symbols from being delivered to DDT. 

6.19 IDENT Program Identification 

symbol IDENT [comment] 
IDENT causes the symbol found in its label field to be delivered to DDT 
as a special identification record. DDT uses the IDENT name in conjunction 
with its treatment of local symbols: in the event of a name conflict 
between local symbols in two different subprograms, DDT resolves the 
ambiguity by allowing the user to concatenate the preceding IDENT name 
to" the symbol in question. . 

IDENT statements are otherwise useful for editing purposes. They 
are always listed on pass 2, usually on the teletype. 
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6.20 DELSYM Delete Output of Symbol Table and Defined Op-codes 

DELSYM [comment] 
DELSYM inhibits the symbol table and opcodes defined in the course of 
assembly from being output for later use by DDT. Its main purpose is to 
shorten the object code output from the assembler. This might be 
especially desirable for an absolute assembly which produces a paper tape 
which is to be filled into the machine. 

6.21 RELORG Assemble Relative with Absolute Origin 

RELORG expression [comment] 

On occasion it is desirable to assemble in the midst of otherwise normal 

program a batch of code which, although loaded into core in some position, 

is destined to run from another position in memory. (it will first 
have to be moved there in a block.) This is particularly useful when 

preparing program overlays. 

RELORG, like ORG, takes an absolute expression denoting some origin 
in memory. It has the following effects: 

(a) The current value of the location counter is saved, i.e. the 
value of the expression and in its place is put the absolute 
origin. This fact is not revealed to DDT, however; during 
loading the next instruction assembled will be placed in the 
next memory cell available as if nothing had happened. 

(b) The mode of assembly is switched to absolute without changing 
the object code format; it still looks like relocatable binary 
program to DDT. All symbols defined in terms of the location 
counter will be absolute. Rules for computing the relocation 
value of expressions are those for absolute assemblies. 

It is possible to restore normal relocatable assembly. (cf. 6.22, RETREL) . 
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Some examples of the use of RELORG follow: 

(1) A program begins with RELORG 300B and ends with END. The 
assembler's output represents an absolute program whose origin is 00300n 
but which can be loaded anywhere using DDT in the usual fashion. (it 
is, of course, necessary to move the program to location 00300o before 
executing it. ) 

(2) A program starts and continues normally as a relocatable program. 
Then there is a series of RELORGs and some RETRELs. The effect is as 
shown below: 



} 



RELORG .100 



RELORG 200 



RETREL 



RELORG 300 



END 



Normal relocatable program. 



Absolute program origined to 100 



Absolute program origined to 200 



Normal relocatable program 



Absolute program origined to 300 



6.22 RETREL Return to Relocatable Assembly 
RETREL [ comment ] 
This directive is used when it is desired to return to relocatable assembly 
after having done a RELORG. It is not necessary to use RETREL unless one 
desires more relocatable program. The use of RETREL is shown in 6.21. 
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The effects of RETKE& are 

(a) to restore the Location counter to what it would have been 
had the RELORG(s) never been used, and 

(b) to return the assembly to relocatable mode. 

6.23 FREEZE P reserve Symbols, Op-codes, and Macros 
FREEZE [comment] 
It is sometimes true when assembling various sub-programs that they share 
definitions of symbols, op-codes, and macros. It is possible to cause the 
assembler to take note of the current contents of its symbol and opcode 
tables and the currently defined macros and include them in future 
assemblies eliminating the need for including copies of "this information 
in every subprogram's source language. This greatly facilitates the 
editing of this information. 

When the FREEZE directive is used, the current table boundaries for 
symbols and opcodes and the storage area for macros is noted and saved away 
for later use. These tables may then continue to expand during the current 
assembly. (A separate sub-program may be used to make these definitions. 
It will then end with FREEZE; END.) The next assembly may then be started 
with the table boundaries returned to what they were when FREEZE was last 
executed. This is done by entering the assembler at its continue entry 
point, i.e. one types 

@, CONTINUE ARPAS. 
Note that when the assembler has been pre-loaded with symbols, opcodes 
and macros it cannot be released (i.e. one cannot use another, sub-system 
like DDT QED, etc.) without the loss of this information. 
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6.2^ NOEXT Do Not Create External Symbols 

Because of its subprogram capability, the assembler assumes auto- 
matically that symbols which are not defined in a given program are external 
and will be defined in another subprogram. It does not therefore list out 
the use of such symbols as errors. 

If a program is in fact a free-standing program, i.e. if it is 
supposed to be complete, then clearly symbols which are not defined are 
errors and should be so noted in assembly. The NOEXT directive simply 
prevents external symbols from being established; thus undefined symbols 
are noted as errors. The directive must be. used at the beginning of a 
program before instructions or data have been assembled. Its use affects 
the entire program. Its form is 
NOEXT [comment] 

6.25 LIST Turn Specified Listing Controls On 

6.26 NOLIST Turn Specified Listing Controls Off 

Most assemblers provide a means of listing a program during assembly, 

i.e. printing out such items as the location counter, binary code being 

assembled, source program statement, etc. The association of these items 

on one page is frequently of great help to programmers. Two directives, 

LIST and NOLIST, control this process. Their form is as follows: 

LIST I r 4- 1 

NOLIST/ 'W^--- £ comment l 

where the s. are from a set of special symbols having 

meaning only when used with these directives. 
There are many listing options for this assembler. A list of special 
mnemonic symbols used in conjunction with these two directives is given 
below. The symbols have special meaning only when used with LIST and 
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NOLIST. They may be used at any other time for any particular purpose. 
The special symbols are: 
Symbol Meaning 

1 Listing during pass 1. Listing format will be 
controlled by other parameters. 

2 Listing during pass 2. Listing format will be 
controlled by other parameters. 

LCT Listing of location counter value (see below) 

BIN Listing of binary object code or values (see below) 

SRC Listing of source language (see below) 

COM Listing of comments (see below) 

MC Listing of macro calls (see below) 

ME Listing of certain directives during macro 

expansions (EQU, NCHR, NARG, RPT, CRPT, ENDR, IF, 
ELSF, ELSE, ENDF, ENDM). 

EXT Listing of external symbols at end of assembly 

NUL Listing of null & duplicate symbols at end of 
assembly. 

As an example of the meanings of various symbols above , consider the line 
of code A21 STB OUTCHR SAVE POINTER. 
It might list as 
v 02157 ; ^36 00217. V A21 STB OUTCHR. C SAVE POINTER. 



LCT BIN SRC COM 

It is not necessary to include each symbol possible, but rather only those 
parameters for -which changes are desired. It is, in fact, not necessary 
to give any symbols. 

LIST is equivalent to LIST 2 
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When the assembler is started, it initializes itself in the following 
way: 

LIST LCT,BIN,SRC,COM,MC,EXT,NUL 
NOLIST 1,2,ME,SYT 
The actual format of the assembly listing is controlled by the current 
combination of parameter values. The parameters are independent items 
except for the parameters MC and ME. ' In this case it is more reasonable • 
to think of their combination. Thus: 
MC ME ' Effect 

List outer level macro calls only 

1 List all macro calls and code generated, but 

suppress listing of certain directives (see ME 
in table above). 

1 List no macro calls, but rather all code generated 

except for certain directives. 

1 1 List everything involved in macro expansions. 

Regardless of the list control parameters which have been given to 
the assembler, it can be made to begin listing at" any time in either pass 
simply by typing a single rubout (typing a second rubout in succession will 
abort the assembly). Listing having been started in this manner can be 
stopped by typing the letter S. 

6 . 27 PAGE Begin New Page on Assembly Listing 

PAGE [comment] 
This directive causes a page eject on the assembly listing medium 
unless a page eject has just been given. It is used to improve the 
appearance of the assembly listing. 

6 . 28 REM Type Out Remarks in Pass 2 

REM remark to be typed 
This directive, when encountered in pass 2, causes the contents of 
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its operand and comments fields to "be typed out either on the Teletype 
or whatever file has been designated as the output message device. This 
typeout occurs regardless of what listing modes are set. The directive 
may be used for a variety of purposes. It may inform the user of the 
progress of assembly. It may give him instructions on what to do next 
(this might be especially nice for complicated assemblies). It might 
announce the last date the source language was updated. Or, it might be 
used within complex macros to show which argument substrings have been 
created during expansion of a highly nested macro (this for debugging 
purposes). 
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7.0 Macros and Conditional Assembly 

Assemblers with good macro and conditional assembly capability can have 

surprising power. This assembler features such capability. In this section 

the facilities for dealing with macros and conditional assembly will be 

discussed. Many examples will be given. 

7*1 Introduction to Macros 

On the simplest level a macro name may be thought of as an abbreviation 

or shorthand notation for one or more assembly language statements. In 

this respect it is like an opcode. The opcode is the name of a binary 

machine command, and the macro name is the name of a sequence of assembly 

language statements. 

EXAMPLE 7-1. 

The $k0 has an instruction for skipping if the contents of a specified 

location are negative, but none for testing the accumulator. SKA (skip 

if memory and accumulator do not compare ones) will serve when used with 

a cell whose contents mask all but the sign bit. The meaning of SKA used 

in this way is "skip if A positive." Thus a programmer will write 

SKA =UB7 

BRU NEGCAS NEGATIVE CASE 

Programs, however, are more than likely to have a logical need for 

skipping if the accumulator is negative. In these situations the programmer 

must write 

SKA =4B7 

BRU *+2 

BRU POSCAS POSITIVE CASE 

Both of these situations are awkward in terms of assembly- language 
programming. 
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But we have, in effect, just developed simple conventions for doing 
the operations SKAP and SKAN (skip if accumulator positive or negative). 
Let these operations be defined as macros. 

SKAP MACRO 

SKA =1+B7 

ENDM 
SKAN MACRO 

SKA =^B7 

BRU *+2 

ENDM 

Now — more in keeping with the operations the programmer has in mind — 

he may write 

.A22 SKAN 

BRU POSCAS 



The advantages of being able to use SKAP or SKAN should be apparent. 
The amount of code written in the course of a program is reduced. This 
in itself tends to reduce errors. A greater advantage is that SKAP and 
SKAN are more indicative of the action that the programmer has in mind. 
Programs written in this way tend to be easier to read. Note, incidentally, 
as shown above that a label may be used in conjunction with a macro. Labels 
used in this way are usually treated like labels on instructions; they are 
assigned the current value of the location counter. This will be discussed 
in more detail later. 
7.2 Macro Definition 

Before discussing more complicated use of macros, some additional 
vocabulary should be established. A macro is an arbitrary sequence of 
assembly- language statements together with a symbolic name . During 
assembly it is held in an area of memory called text storage . Macros 
may be created or defined . To do this one must give (l) a name and 
(2) the sequence of statements comprising the macro. The name and the 
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beginning of the sequence of statements in a macro are designated by 
the use of the MACRO directive (see ex. 7-1 above). 

name MACRO 



ENDM 
The end of the sequence of statements in a macro is signalled by the 
ENDM directive. 

The reader should now refer to Figure 1. When the assembler en- 
counters a macro definition (i.e., when it sees a MACRO directive), switch 
B is thrown to position 1. The programmer's source language is merely 
copied into text storage; note in particular that the assembler does not 
do any processing during the definition of a macro. Switch B is put back 
to position when ENDM is encountered. 

It is possible that within a macro definition other definitions may 
be imbedded. The macro defining machinery counts the occurrences of the 
MACRO directive and matches them against the occurrences of ENDM. Switch 
B is placed back in position actually only when the ENDM matching the 
last MACRO is seen. Thus MACRO and ENDM constitute opening and closing 
brackets around a segment of source language. Structures like the 
following are possible: 
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SOURCE 
LANGUAGE 



Binary Machine 
Language 



ASSEMBLER 



7jC 







B 



SYMBOLIC 
ASSEMBLY 
LANGUAGE 

• * 



0/ . 1 



TEXT 
STORAGE 



B 






1 
1 




1 


1 



Effect 

normal assembly 

macro definition 

macro expansion 

macro definition during macro expansion 
(to "be explained in more detail later). 



Figure 1: Information Flow During Macro Processing 
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The utility of this structure will not be discussed here. Use of this 
feature of imbedded definitions should in fact "be kept to a minimum since 
the implementation of this assembler is such that it uses large amounts 
of text storage in this case. What is important, however, is an under- 
standing of when the various macros are defined. In particular, when 
namel is being defined, name2,3, etc. will not be defined; they are 
merely copied unchanged into text storage. Name2 will not be defined 
until namel is used . 
7»3 Macro Expansion 

The use of a macro name in the opcode field of a statement is referred 
to as a call . The assembler, upon recognizing a macro call, moves switch A 
to position 1 (again see Figure l) . Input to the assembler from the 
original source language ceases temporarily and comes instead from text 
storage. During this period the macro is said to be undergoing expansion . 



-* It should be noted that macros — like opcodes — may be redefined. 
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It is clear that a macro must first be defined "before it is called. 

An expanding macro may include other macro calls; and these, in 
turn, may call still others. In fact, macros may even call themselves 
(when this makes sense). This is called recursion . Examples of the 
recursive use of macros are given later. When within a macro expansion 
a new macro expansion begins, information about the progress of the current 
expansion is put away. Successive macro calls cause similar information 
to be saved. At the end of each expansion the information about each 
previous expansion is restored in inverse fashion. When the final 
expansion terminates, switch A is placed back in position 0. Input then 
resumes from the source language program. 
7.I4. Macro Arguments 

Now let us carry example 7-1 one step further. One might argue that 
the action of skipping is itself awkward. It might be preferable to write 
macros BRAP and BRAN (branch to specified location if contents of accumulator 
are positive or negative). How is one to do this? The location to which 
the branch should go is not known when the macro is defined; in fact, 
different locations will be used from call to call. The macro processor, 
therefore, must enable the programmer to provide some of the information 
for the macro expansion at call time . This is done by permitting dummy 
arguments in macro definitions to be replaced by arguments (i.e., arbitrary 
substrings) supplied at call time. Each dummy argument is referred to in 
the macro definition by a subscripted symbol. This symbol or dummy name 
is given in the operand field of the MACRO directive. 
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EXAMPLE 7-2 

Let us define the macro BRAP. 

BRAP MACRO DUM 
SKAN 

BRU DUM(l) 
ENDM 

When called by the statement BRAP POSCAS 

the macro will expand to give the statements 

SKA =^B7 
BRU *+2 
BRU POSCAS 

Note that BRAP was defined in terms of another macro SKAN (a matter 

of choice in this example). Also note that as defined, BRAP was intended 

to take only one argument. Other macros may use more than one argument. 

EXAMPLE 7-3 

The macro CBE (compare and "branch if equal) takes two arguments. 

The first argument is the location of a cell to "be compared for equality 

with the accumulator; the second is a "branch location in case of equality. 

The definition is 

CBE MACRO D 

SKE D(l) 

BRU ' *+2 

BRU D(2) 

ENDM 

When called by the statement 

CBE =21B,EQL0C 

the statements generated will be 

SKE =233 
BRU *+2 
BRU EQLOC 
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Note that arguments furnished at call time are separated by commas. 

It is possible to include "both commas and spaces in arguments by enclosing 

the arguments in parentheses; the macro processor strips off the outermost 

parentheses of any substring used in a call. For example in the call of 

the macro MUMBLE 

MIMBLE A, (B,C),(D E) 

we have 

B(l) = a 
d(2) = B,C 
D(3) = D E 

7.5 The Use of Dummy Arguments in Macro Definitions 

Before giving further examples of the use of macros, the various 
ways that dummy arguments may be used in macro definitions will be 
discussed. In general a dummy may be referred to by the symbolism 

dummy ( expre s s ion ) 
The only restriction on the expression above is that it must not contain 
other dummies or generated symbols (see 7«7)« Furthermore, for obvious 
reasons it must have a known value when the macro is called . 
More than one dummy may be referred to by the notation 

dummy (express ion, expression) 
In the case of the call 

MUMBLE A,B,C,D,E 
then 

D(3,5)= C,D,E 
But it is possible to have confusion in this situation. If we have the call 

MUMBLE A,B,C,(D,E),F 

*It should be noted that a macro call may deliver more arguments than are referred 
to in its definition, but the converse is not true. A dummy argument not supplied 
with an argument at call time is considered an error. 
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then 

DlM(3,5) a C,D,E,F 
But which are DIM(3), DUM(U), and DUM(5)? To resolve this ambiguity, the 
assembler produces in place of DUM(3,5) the string 

(C),(D,E),(F) 
The notation 

dummy () 
produces all of the arguments supplied in a macro call. Each is surrounded 
by parentheses as in the example above. 
The symbolism 

dummy (0) 
is legal and meaningful. It refers to the label field of the macro call. 
Normally a label used with a macro call is assigned the current value of 
the location counter (as with any instruction). Explicit use of dummy(O), 
i.e., literal zero in parentheses, causes the label field not to be 
handled in the normal way. It serves merely to transmit another argument. 
There are three possible cases. 

(1) Macro contains no references to dummy(O) . Label field is 
treated normally. 

(2) Macro contains at least one reference to dummy(O). Label field 
merely transmits an argument which replaces dummy(O) in the 
expansion. 

(3) Macro contains no references to dummy(o) explicitly but does 
contain dummy(expression) where, at call time, the value of the 
expression is zero. In this case the label field is handled as 
in case (l) and also used to transmit the argument referred to by 
dummy(expression) as in case (2). 
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The symbolism 

dummy(-l) 
is used to represent the terminal character of the opcode field, i. e., to 
determine whether the macro name terminated with a blank or a * (in case 
of indirect address). It allows macros to be called with or without 
"indirect addressing" specified. Thus in a typical call we have the 
following relationships: 

M17, CALL* ABC,DEF, ! GHI*,JKL 

T t ^f-Ou^ 

dummy(O) dummy(-l) dummy(l)Y dummy(3,^) 

dummy() 

Note that dummy (-1) is always one character long. 

Sometimes in a macro definition it is desirable to refer only to a 
portion of an argument, perhaps to a character or a few characters. In the 
case of a single character this may be done by writing 

dummy (expression$expression) 
The first expression designates which argument; the second determines 
which character of that argument. If a substring of an. argument is 
desired, one writes 

dummy (express ion$expression, expression) 
The second and third expressions determine the first and last characters 
of the substring. For example, if we have the call 

MUMBLE A, BCDE/FGHIJ 1 
then . . . 

DUM(2 $3) = D 

DUM(3 $^,7) = HU 1 
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Beginning with the ith character the latter part of an argument can be 
obtained by specifying an over large terminal bound. Thus 

DUM(2$1^ 1000) = HIJ 1 
7.6 Concatenation 

It is frequently useful to compose statements out of macro arguments 
(or parts of them) and other information given in the macro definition. 
This is done by concatenating the various objects together, i.e. simply 
writing them next to each other. It is possible to confuse the assembler 
when doing this, however. For example, let the dummy name in a definition 
be C, and suppose we wish to concatenate the strings AB and C(3)« If we 
write ABC(3), then do we mean AB concatenated with C(3)> A concatenated 
with BC( 3) (whatever that is), ABC(3), or what? 

To avoid ambiguity we use the character " . " (dot or period) as a 
concatenation delimiter. For. the example just above we would write 
AB.C(3)> and no ambiguity then exists. The assembler uses the dot to 
delineate objects it must deal with; in producing output the macro expansion 
machinery . after having recognized the various objects simply skips over 
the dots . The dot character cannot therefore be - used literally in a macro 
definition . 

EXAMPLE 7-b 

Let us define a macro STORE. Suppose we have established the 

convention that certain temporary storage cells begin with the letters 

A,B, or X, depending on from what 9^0 register information is to be stored 

there. The definition is 

STORE MACRO D 

ST.D(1$1).D(-1) D(l) 

ENDM 



If called by the statements 



STORE B17 
STORE* Xkk 
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the macro will expand as 

STB B17 or STX* Xkk 
The dot is not actually needed in every incidence of concatenation. 
Some programmers may readily determine for themselves when it is actually 
needed. As a matter of good practice, however, when in doubt, use it I 
7.7 Generated Symbols 

A macro should not, of course, have in its definition an instruction 
having a label. Successive calls of the macro would produce a multiply 
defined symbol. Sometimes, however, it is convenient to put a label on 
an instruction within a macro. There are at least two ways of doing this. 
The first involves transmitting the label as a macro argument when it is 
called. This is most reasonable in many cases; it is in fact often 
desirable so that the programmer can control the label being defined . 
and can refer to it elsewhere in the program. 

However, situations do arise in which the label is used purely for 
reasons local to the macro and will not be referred to elsewhere. In 
cases like this it is desirable to allow for the automatic creation of 
labels so that the programmer is freed from worrying about this task. 
This may be done by means of the generated symbol . 

A generated symbol name may be declared when a macro is defined. To 
do this requires two things; (l) the name and (2) the maximum number of 
generated symbols which will be encountered during an expansion. These 
two items may follow the dummy symbol name given in the MACRO directive. 
The actual format used is 

name MACRO dummyname,generatedname, expression 
For example, we might have 

MUMBLE MACRO D,G,^ 

• 

ENDM 
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In the definition of this macro there might "be references to 

G(l), G(2), G(3), and G(h), these "being individual generated symbols. 

With regard to generated symbols the macro expansion machinery 
operates in the following fashion. A generated symbol base value for each 
macro is initialized to zero at the beginning of assembly. As each 
generated symbol is encountered, the expression constituting its subscript 
is evaluated. This value is added to the base value, and the sum is pro- 
duced as a string of digits concatenated to the generated symbol name. 
Enough digits are produced to make the resultant symbol six characters 
long. Thus, the first time MUMBLE is called, for example, G(2) -will be 
transformed into G00002, G(*0 into GO0O0H, etc. 

At the end of a macro expansion, the generated symbol base value is 
incremented by the amount designated by the expression following the 
generated symbol name in the MACRO directive. (This was k in the 
definition of MUMBLE above.) Thus the second call of MUMBLE will produce 
in place of G(2), G00006, the third call will produce G00010, etc. It" 
should be clear that a generated symbol name should be kept as short as 
possible. It cannot be longer than 5 characters. 
7*8 Conversion of a Value to a Digit String 

As an adjunct to the automatic generation of symbols or for any other 
purposes for which it may be suitable a capability is provided in the 
assembler ! s macro expansion machinery for conversion of the value of an 
expression at call time to a string of decimal digits. The construct 

(^expression) 
will be replaced by a string of digits equal in value to the expression. 
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For example, let X = 5. Then 
AB.($2*X-l) 
will be transformed into 

AB9 
Further examples of the use of this facility appear below. 
7-9 The NARG and NCHR Directives 

Macros can be more useful if the number of arguments supplied at 
call time is not fixed. The precise meaning of a macro (and indeed, the 
results of its expansion) may depend on the number or the arrangement of 
its arguments. In order to permit this the macro undergoing expansion must 
be able to determine at call time the .number of arguments supplied. The 
NARG directive makes this possible. 

NARG functions basically like EQU, except that no expression is used 
with it. Its basic form is 

symbol NAEG [comment] 
The function of the directive is to equate the value of the symbol to the 
number of arguments supplied to the macro currently undergoing expansion. 
The symbol can then be used by itself or in expressions for any required 
purpose. Examples of the use of NARG appear later. 

It is also useful to be able to determine at call time the number of 

characters in an argument. NCHR functions by equating the symbol in its 

label field to the number of characters in its operand field. Its form is 

symbol NCHR characterstring [comment] 

The notion of "operand field" must be elaborated on here. The operand field 

normally terminates on the first blank after the beginning of the field. 

This rule is rescinded if a_ macro argument containing blanks appears in 

the operand field . For example, in the statement 

XYZ LDA VECTOR, 2 THIS IS A COMMENT 
t t 
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the arrows delineate the operand field. Alternatively, if a statement like 

TEXT X,D(l). ERROR 
is placed in a macro definition and the macro is called by 

MUMBLE (NON-FATAL ) 

then the above statement will turn out to be 

TEXT X,NON-FATAL ERROR 
t t 

Notice how the operand field terminates in this case. 

In the same example notice that the message produced by the text 

directive is of unspecified length at definition time. Clearly, X must 

depend on the number of characters in D(l). Accordingly, MUMBLE might be 

defined as 

EXAMPLE 7-5 

MUMBLE MACRO D 

X NCHR D(l) 

X EQU X+9 5 FOR ' ERROR', k TO ROUND UP 

TEXT X/^,D(l). ERROR 

ENDM 

7-10 Conditional Assembly 

The reader should see by now that the macro is a powerful tool. 
Its power, however, is considerably multiplied when combined with the 
features explained in this and the following sections. These features — 
basically the if and repeat capabilities — are called conditional 
assembly capabilities because they permit assembly-time calculations 
to determine the source language actually assembled. They are, however, 
not strictly a part of the macro facilities and may be used quite apart 
from macros. 
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7.11 The RPT Directive 

The RPT (repeat) directive is, like the MACRO directive, an opening 
"bracket for a segment of program. Its form is 

(1) [label] RPT expression [comment] 

or using s for symbol, e for expression, and c for comment 

(2) [label] RPT (s^,^ ]e^) [c] 

(3) [label] RPT ( s=e ^ t e 2 > l e 3 )( s=e 1 ^ e 2^ )( s=e 1 ^ e 2 l )" ' • ^ 
Form (l) says to repeat the following sequence of statements down to the 
matching ENDR (end repeat) as many times as given by the value of the 
expression. Forms (2) and (3) are really the same form; they are shown 
separately to emphasize that only the first parenthesized group in the 
operand field must be present. Their meaning is as follows: 

(1) Set the symbol s to the value of e.. 

(2) Issue the sequence of statements down to the matching ENDR. 

(3) Increment s by the value of e„ or by one (if e is not present). 

If the new value of s has not passed the limit, go back 
to (2). When the limit is passed, quit. 

In other words, for symbol=e 1 step e^ until e^ do ... 

or for symbol=e.. until e._ do . . . 
The first parenthesized group (l) determines the number of times the 
repeat is executed and (2) controls the initial value and increment of a 
symbol. Subsequent groups (there may be up to ten of them) merely control 
the initial value and increments of other symbols carried along in the 
recent operation. 
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EXAMPLE 7-6 

It is desired to create an area of storage which is cleared to zero. 
The BSS directive cannot be used for this purpose since its function (that 
of reserving storage) is basically to advance the assembler's location 
counter .• The problem is readily solved by 



ABC 


RPT 


100 




DATA 
ENDR 





which is equivalent to 






ABC 


DATA 






DATA 





DATA 





DATA 





DATA 






100 statements 
DATA V 

Note that the label is applied effectively only to the first statement. 
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EXAMPLE 7-7 

It is desired to fill an area of storage with data starting with 
and increasing by 5 for each cell. We may write 

X EQU 

RPT 20 

DATA X 

X EQU X-*5 

ENDR 

Alternatively (and more simply) one can write 

. RPT (X=0,5,100) 

DATA X 
ENDR 

Note that in the latter form the terminal value (i.e., e ) does not have 

to be positive or greater than the initial value of the symbol being 

incremented. 

RPT (X=100,-5,20) 

and RPT (X=INIT,-5,-30) 

are both permissible. 

Also note that a repeat directive followed by other statements and 
an associated ENDR (referred to as a repeat block ) may be imbedded in other 
repeat blocks. This is similar to the imbedding of macro definitions in 
other macro definitions, and repeat structures similar to that shown in 
section 7*2 may be used. 
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EXAMPLE 7-8 

It is desired to have a pair of macros SAVE and RESTOR for purposes 
of saving and restoring active registers at the beginning and end of 
subroutines. These macros should take a variable number of arguments 
so that one can write, for example, 

SAVE A,SUBRS 
or perhaps 

RESTOR A,B,X,SUBRS 
These calls are intended to generate the code 

STA SUBRSA 
and 



LDA 


SUBRSA 


LDB 


SUBRSB 


LDX 


SUBRSX 



We first define a generalized macro MOVE which is called by the same 

arguments delivered to SAVE and RESTOR plus the strings 'ST* and 'LD' 

which determine whether one wishes to store or load. 

MOVE MACRO D 
X NARG 

RPT (Y=2,X-l) 

d(i).d(y) d(x).d(y) 

ENDR 
ENDM 

Then, in terms of MOVE, SAVE and RESTOR are readily defined as 

SAVE MACRO D 

MOVE ST,D() 
ENDM 

RESTOR MACRO D 

MOVE LD,D() 
ENDM 
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EXAMPLE 7-9 

Many programs make use of flags, memory cells which are used as 
binary indicators. The SKN (skip if memory negative) makes it easy to 
test these flags. Let us adopt the convention that a flag is set if it 
contains the value -1 and reset if it contains zero. We want to develop 
the macros SET and RESET to manipulate flags. It is further desirable 
to deliver at call time the name of an active register which will be used 
for the action, together with a variable-length list of flag locations. 
Calls of these macros will look like 

SET A,FLG1,FLG2,FLG3 

or 

RESET X,FLG37,FLG12 

As in example 7-8 we make use of an intermediate macro STORE which 

takes the same arguments. 

STORE MACRO D 
X NARG 

RPT (Y=2,X) 

st.d(i) d(y) 

ENDR 
ENDM 

Thus SET and RESET are defined as 

SET 



RESET 



MACRO 


D 


LD.D(l) 


=-1 


STORE 


D() 


ENDM 




MACRO 


D 


CL.D(l) 




STORE 


D() 


ENDM ■ 
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7.12 CRPT, Conditional Repeat 

Occasionally one wishes to perform an indefinite number of repeats, 
termination coming on an obscure condition determined in the course of the 
repeat operation. The conditional repeat directive, CRPT, serves this 
function. Its effect is like that of RPT (and its repeat block — like 
RPT — is closed off by a matching ENDR) except that instead of giving 
a number of repeats its associated expression is evaluated each time in 
a Boolean sense to determine whether the repeat should occur again. Its 
form is 

[label] CRPT expression[, (s=e..[,e ] ), (s=e_[,e ]). . . ] 

[comment] 

One may write, for example, 
CRPT X>Y 
or CRPT STOP, (X=l,2)(Y=-3) 

Note that the statement 

CRPT 10 
will cause an infinite number of repeats. 

The termination of a CRPT operation is governed by whether the value 
of the expression is one or greater. Zero or negative quantities are 
taken to mean don't repeat (Boolean or false ). Values of one or greater 
mean do repeat (Boolean 1 or true ). 

An example of the use of CRPT is shown in example 7-H« 

7.13 IF Capability 

It is frequently desirable to permit the assembler either to assemble 
or merely skip blocks of statements depending on the value of an expression 
at assembly time. This is primarily what is meant by the term conditional 
assembly . Conditional assembly can be done (inelegantly) with CRPT. 
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Let the condition be given by an expression. (Once again a Boolean 
value is ascribed to an expression in the manner 

if e<0 

1 if e>0.) 
Then one may write 

EXAMPLE 7-10 

C EQU condition 
CRPT C 

\ arbitrary block of statements 

C EQU 
ENDR 

Note that the line before ENDR is required to prevent the CRPT from going 

forever. By using the structure above, however, conditional assembly may 

be done; the arbitrary block of statements enclosed in the repeat body 

may be assembled on condition. 

7 • 1^ IF , Assemble if Expression True (i.e., > 0) 

The same function shown in example 7-10 is performed much more 
conveniently by the IF directive. Its form is 

[label] IF expression [comment] 

ENDF 
As with RPT and CRPT, the IF directive defines the beginning of a block 
of statements (called the if body ) terminated by a matching ENDF. The 
if body may contain other if bodies. 

When doing conditional assembly there are often alternative if bodies 
to be assembled in case a certain if body does not assemble. This situation 
is most easily dealt with by the use of the ELSF and ELSE directives. 
These provide an end to the if body and also begin another body which is 
to be assembled (again possibly on condition) in case the first body did 
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not. For example, consider the following structure: 

IF e„ 



1 



body. 



ELSF e 2 



} 



body 2 



ELSF e 



} 



body 
ELSE 

Jr bod yi + 
F 

If e..>0, body is assembled and bodies ? ~ . are skipped (regardless of 

e and e . 

If e n <0 and eJ>0, body ? is assembled and bodies . are skipped. 

If e 1 and e <0 and e_>0, body, is assembled and bodies , are skipped. 

Finally if e.,, e^, and e_<0, body, is assembled. 

An example of the use of IF (and other features) follows. 
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EXAMPLE 7-10 

This example serves to illustrate several of the preceding features 
and also the power of macros used recursively. The macro MOVE is intended 
to take any number of pairs of arguments. The first argument of each pair 
is to be moved to the second. Each argument, however, may itself be a 
pair of arguments, which may themselves be pairs, etc. 

We first define MOVE. Basically it extracts pairs of argument 
structures and transmits such a pair to another macro M0VE1. 

D 



MOVE 


MACRO 


X 


NARG 




RPT 




M0VE1 




ENDR 




ENDM 



(Y=1,2,X}(Z=2,2) 
D(Y),D(z) 



We now define M0VE1. It calls itself recursively until it comes up 
with a single pair of arguments. Then it generates code. 



M0VE1 MACRO 


D,G,2 


G(l) NARG 




G(2) EQU 


t ^ 


IF 


Gfl)=2 


LDA 


D(l) 


STA 


D(2) 


ELSE 




RPT 


G(l)/2 


G(2) EQU 


G(2)+l 


U EQU 


0(1 
0(2) 


V EQU 


M0VE1 


D(v),D(V+U/2) 


ENDR 




ENDF 




ENDM 




Thus when called by the line 




MOVE 


A,B 


the code generated will be 




LDA 


A 


STA 


B 
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When called by 



the code generated is 



When called by 



the code generated is 



Finally when called by 
the code generated is 



MOVE A,B,C,D 



LDA 


A 


STA 


B 


LDA 


C 


STA 


D 



MOVE (A,B),(C,D) 



LDA 


A 


STA 


C 


LDA 


B 


STA 


D 



MOVE ((A,B), (D,D)), ((E,F), (G,H)) 



LDA 


A 


STA 


E 


LDA 


B 


STA 


F 


LDA 


C 


STA 


G 


LDA 


D 


STA 


H 



In this case the main call results in the call - 

M0VE1 (A,B),(C,D),(E,F),(G,H) 



M0VE1 calls itself by 
and again: 



M0VE1 A,B,E,F 



MOVEl A,E 

where the first code is generated. Then we get 

MOVEl B,F 
Recursion then pops up to the call 

. MOVEl C,D,G,H 
and so on. 
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EXAMPLE 7-11 

The following example makes use of virtually every feature in the macro 
and conditional assembly machinery. It is presented as a demonstration of 
the power inherent in the use of macros but not as a practical tool (critics 
have justly termed it the world 1 s slowest compiler). The macro COMPILE when 
called with an arithmetic expression for its argument produces assembly 
language which computes the value of the expression in a minimum number of 
steps (subject to the left-to-right scan technique used). COMPILE in turn 
calls a large number of other macros. Their functions are explained by comments 
in the text below: 

The COMPILE macro itself merely initialires some variables and calls 
EXPAND where the more difficult work is done. J is the total number of 
characters in the expression. K is used to keep track of the recursion level 
on which the work is being done (EXPAND calls itself recursively when it sees 
an opening bracket [ ) . AVAIL is the counter for available temporary storage . 
NPTR and PPTR are stack pointers for the operand and operator stacks respectively. 



COMPILE MACRO D;J NCHR DC 1 )'; K ECU 0;AVAIL EQU 1 ; NPTR EQU -1;PPTR EQU -1 
EXPAND D(l); ENDM 



EXPAND initializes I> the current character pointer. It places 
the value zero on the operator stack (marking its beginning on the current 
level) and fetches the first operand. It then sets a switch (G(l)) and goes 
into a cycle of fetching operators (GETP) and operands (GETN). If the 
precedence of new operators is less than or equal to that of the previous 
operators, code is generated. Otherwise the information is stacked and the 
scan continued. 
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EXPAND MACRO D , G , 1 ; I ECU 1;K EQU K+l; STACK 0,P; GETN DC 1 ) ; SET G( 1 ) 

CRPT GC1) 

IF I<J; GETP DC 1 SI) 

ELSE; OP TOR EQU 11; RESET GC1) 

ENDF • 

;PSTAK EQU PST.CSPPTR) 

CRPT OPTOR/1C<PSTAK/10+1; GEN DO) 

rF D OPTOR--ll;PPTR F.QU PPTR-1; RESET GO ) ; K EQU K-1;I EQU I.C$K) + I-1 
ELSE; STACK CPTCR,P 
. IF NPTR>0 

IF NST.(iMPTn-l)<0 T N 

IF NST.($NPTR-1)=-1; STA TEMP . ( SAVAI L) 
ELSE; RSH 1; STB TEMP . C $AVA 1 L) 
F ND F 
;NST.($MPTR-1) EQU A VAIL; AVAIL EQU AVAIL+1 

ENDF 
E^DF 

GETM DO SI, J) 
ENDF 
ENDR 
ENDM 



SET and RESET change the setting of flags. STACK is used to put values 
and pointers on "stacks." (These are not, of course, physical stacks in 
memory but rather conceptual ones existing in the assembler's symbol table.) 
STACK functions by creating an ordered progression of names and assigning 
values to the names by means of the EQU directive. 

SET MACRO D;DO) EQU 1; EMDM 

RESET MACRO D; DO ) EQU 0; ENDM 

STACK MACRO D; TS EQU DC 2) .PTR+1 ; DC 2) .PTR EQU TS; DC 2) .ST. C $TS) EQU DO) 
ENDM 
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GETN fetches. the next operand. Its complexity is due to the fact that 
it must recognize symbols (in this example using the assembler's symbol rules) 
and numbers. When this recognition is complete it puts in the operand stack 
a pair of pointers to the head and tail of the operand (i.e., character numbers 
in the string and a flag bit which denotes whether the object is a symbol or 
a number. Note that if an opening bracket is seen, GETN calls EXPAND recursively. 



GETM MACRO D; TO EQ U I; RESET ERROR; GETC D(1$I-TG+1) 
IF CHAR='[ '; I.($K) EQU I; EXPA ND DC 1 S2 , J) 
ELSE 

IF LETTER; RESET NUMBER 

ELSE; SET JUMPER 

ENDF 

I F DIGIT; SET SWITCH 

CRPT SWITCH; GETC D(1$I-T0+1) 
IF DIGIT 
ELSF LETTER; RESET SWITCH 

IF CHARr'B'; GETC DC1SI-T0+1) 
IF LETTER; RESET NUMBER 
ELSF DIGIT; RESET NUMBER 
' ENDF 
ELSE; RESET NUMBER 
ENDF 
ELSE; RESET SWITCH 
ENDF 
ENDR 
ELSF LETTER 
ELSE; SET ERR OP 
ENDF 

IF NUMBER 
ELSE; SET SWITCH 

CRPT SWITCH; GETC DC1SI-T0+1) 
IF LETTER 
ELSF DIGIT 
ELSE; RESET SWITCH 
ENDF 
ENDR 
ENDF 

IF ERROR; ERROR; STACK 0,M 
ELSE; STACK T0*1 E4+I-2+4F 3* NUMBER, N 
ENDF 
;I ECU 1-1 
ENDF 
ENDM 
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GETC's main function is to determine Whether a given character is a 
letter, digit, or other type of character. GETP fetches the next operator. 
It does some checking of the results and if valid sets OPTOR to a value 
carrying "both operator and precedence information. 



GETC IWCRO D;CHAR EQU 'DC 1 ) * ; I EOU I + 1;A EQU CHAR> , Z , ;E EQU CHAR<'A' 
IF AC0R)B;A ECU CHAR> , 9 , ;P EQU CHAR< , 0' 
IF ACCR)P; RESET LETTER; RESET DIGIT 
ELSE; SET DIGIT; RESET LETTER 
EN OF 
ELSE; SET LETTER; RESET DIGIT 
ENDF 
ENDM 

GETP MACRO D; GETC D( 1 ) 

IF LETTER(OR)DIGIT; FRROR 

ELSE; A ECU CHAR>11B6;E EQU CHAR<20B6 

IF A(AND)P; OPTOR ECU OPS . ( SCHAR/1 E6) 

ELSF CHAR= '] ';0PTCR ECU 11 

ELSE; OP TOR ECU -1 

ENDF 

IF OPTORr-1; ERROR; OPTOR ECU 40 

ENDF 
ENDF 
EM DM 



GEN and GENA serve to reconstruct the operands from the string pointers 
and call generators which actually produce code. 



GEN MACRO D; R EQU -1;PP2 EG U PST. ( SPPTR) ; PP3 EQU NST.C SNPTR-1 ) 
;PP4 EOU PP3/1P4;PP5 EQU PP3-?P4*1B4 

IF PP5>4E3;PP5 EQU PP5-4D3; SET LIT1; RESET LI T2 

ELSE; RESET LIT1; RESET LI T2 

ENDF 

IFPP3>1B4; GEMA DC 1 ) , DC 1 $PP4 ,PP5) 
ELSF PP3>0; GENA DC 1 ) , TEMP . C SPP3) ; A VAI L lCU PP3 

ELSFPP3=-1; GENA DC1),AREG 

ELSF PP3=-2; GENA D(1),BRE6 

ENDF 
;NPTR EQU NPTR-2; STACK R,N;PPTR EQU PPTR-1;PSTAK EOU PST.CSPPTR) 

ENDM 



R-26 
7-30 



GEMA MACRO D;PP5 EG U NST. C SNPTR) ; PP6 EQ U PP5/1E4 
;PP7 EQU PP5-PP6*1E4 

IF PP7>4B3;PP7 EQU PP7-4B3; SET LI T2 
ENDF 

IFPP5>1P4; GEN.CSPP2) DC 2) , DC 1 $PP6,PP7) 
ELSF PPf>>0; GEN.($PP2) D(2) f TEMP. ($PP5) ; A VAIL EQU PP5 
ELSF PP5=-1; GEN.CSPP2) D(2),ARFG 
ELSF PP5:-2; GEN.CSPP2) D(2),BREG 
ENDF 
E\!DM 



GEN20, 21, 30, 31 and ^4-0 are the code producing macros. They make 
reference to LIT1 and LIT2 (flags set by GEN and GENA.) and call macros 
TEST, LA, LB, and ST. The purpose of the latter macros is to worry about 
the meaning of the contents of the A and B registers so as not to inject 
superfluous code. 



GEN20 MACRO D; TEST DC 1 ) , D(2) ,X ; LA DCX) ,LI T. C SX) 
IF X=l 

IF LI T2; ADD =.D(2) 

ELSE; ADD D(2) 

ENDF 
ELS E 

IF LIT1; ADD =.D( 1) 

ELSE; ADD DC 1) 

ENDF 
ENDF 
ENDN 

GEN21 MACRO D; TEST DC2)',X 
IF X; LA DC2) ,LIT2 

IF LIT1; CNA; ADD r.D(l) 

ELSE; CNA; ADD DC1) 

ENDF 
ELSE; LA DC1),LIT1 

IF L1T2; SUE =.DC2) 

ELSE; SUB DC2) 

ENDF 
ENDF 
ENDM 
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GEN30 MACRO D; TEST D(1),D(2),X; LA D(X) ,LI T. ( $X) 
IF X = l 

IF LIT2; ; V ;UL =.D(2) 

ELSE; MUL DC 2) 

ENDF 
ELSE 

IF LIT1; MUL r.DC 1) 

ELSE; MUL D( 1) 

ENDF 
ENDF 

;R EQU -2 
ENDM 

GEN31 MACRO D; TEST D(2),X 

IF X; ST D(2$l); LP D(1),LIT1; DI V TEMP.(SAVAIL) 
ELSE; LB D(l) ,LIT1 

IF LIT2; DIV =.D(2) 
ELSE; DIV D(2) 
ENDF 
ENDF 
ENDM 

GEN40 MACRO D; NOP D(l); NOP D(2) 
ENDM 

LA MACRO D 

IF 'D(l) 'r'AREG' 

ELSF f D(l) '= 'BREG'; LSH 23 

ELSE 

IF D(2); LDA =.D'(1) 
ELSE; LDA D(l) 
ENDF 
ENDF 
ENDM 

LB MACRO D 

IF f D(l) 'z'BREG' 
ELSE 

IF , D(1) 'r'AREG' 
ELSE 

IF D(2); LDA = .D( 1) 
ELSE; LDA D(l) 
ENDF 
ENDF 
RSH 23 
ENDF 
ENDM 

ST MACRO D 

IF f D(l) '^'BREG'; RSH 1 
ENDF 
ST.D(1$1) TEMP. <$A VAIL) 
ENDM 
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TEST MACRO D;Y NARG; D(Y) EG U 
"rpt (Z = 1,Y-1) 

IF , D(ZSl,4)'= , ARECi , ;D(Y> EQUZ 
ELSF , D(Z$1,4) •r'EREG'-jDCY) EQUZ 
ENDF 
ENDR 
I F Y>2 

I F D(Y)=0; D(Y) EQU 1 
E m D F 
Ef-'DF 
ENDM 



The following lines establish precedence information for the arithmetic 
operators . 



0PS10 EQU 30;0PS11 EQ U 20;CPS12 EQ U -1;0PS13 EQ U 21;CPS14 EQ U -1 
OPS 15 EQU 31 



When called by the following lines, the macro generates code as shown: 



Call: 


COMPILE 


X- 


Re suit : 


LDA =200 




MUL Y 






ADD X 




Call: 


COMPILE 


Aj 


Result: 


LDA 


C 




ADD 


D 




STA 


TEMPI 




LDA 


E 




ADD 


F 




STA 


TEMP2 




LDA 


TEMPI 




RSH 


23 




DIV 


TEMP2 




CNA 






ADD 


AB 



X+200*Y 



AB-[C+D]/[E+F] 
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Call: COMPILE A+200#3 l *C21-[DEF/3 i +B-HI*[ J+20*K]/lM33B - N]/0PQ-22 

Result: 



LDA 


=200 


MUL 


3*+C21 


LSH 


23 


ADD 


A 


STA 


TEMPI 


LDA 


DEF 


RSH 


23 


DIV 


=3*+B 


STA 


TEMP2 


LDA 


=20 


MUL 


K 


LSH 


23 


ADD 


J 


MUL 


HI 


DIV 


LM33B 


CWA 




ADD 


TEMP2 


SUB 


N 


RSH 


23 


DIV 


OPQ 


CNA 




ADD 


TEMPI 


SUB 


=22 
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7 . 15 Sp ecial Symbols in Conditional Assembly 

Although in the introduction it is stated that symbols consist 
only of letters and digits, it is possible to include the colon in 
symbols. DDT, however, does not regard the colon as part of a symbol. 
The meaning of this is that DDT will type out such symbols but they 
cannot be typed in. In effect this makes them useless, and it is for 
this reason that the legality of colons in symbols has just now been 

mentioned. 

Yet by judiciously choosing when to use the colon in a symbol 
the feature can become worthwhile. In particular it can be used in 
macros and other obscure places in the program to avoid possible 
conflicts with other names. This might be particularly useful to 
distinguish between symbols used in assembly-time calculations and 
those used at run-time. 
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8.0 Assembler Error Messages 

Upon discovering an error in the syntax of a program being assembled, the 
assembler will list the statement in question and information about the 
character of the error. The listing of errors will occur regardless of whether 
regular listing is being done. 
8.1 Error Messages 

Error messages and their interpretations are given below. The first 
group deals with difficulties found in a single statement. 
Error Meaning 
D Duplicate symbol. 

L Error in label field; most likely not a valid symbol. 

M Missing field in statement. 

Invalid or undefined opcode. 

R Relocation error in expression. 

S General syntax error. 

U Undefined symbol. 

If when calling a macro the user fails to deliver an argument required 
during expansion, the assembler will replace the argument with the character t 
and issue an undefined symbol message at that point. 

The second group of error messages deal with more complicated difficulties 

Error Message Meaning 

SYMBOL TABLE FULL. ERROR Too many symbols and/or opcodes have 
CHECK CONTINUES. been defined. Assembly will continue, 

but no new symbols or opcodes will be 
recognized. Break the program into 
sub-programs or otherwise reduce the 
number of symbols present. 
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Error Message 

LITERAL TABLE FULL. FUR- 
THER LITERALS IGNORED- 

MUST ASSEMBLE ABSK2£ ON 
PAPER TAPE 



INPUT STACK OVERFLOW 



EOF — END CARD ASSUM2D 



Meaning 

Similar to the case above. Reduce the 
number of literals present. 

The bootstrap loader for self-filling, 
absolute assemblies is intended for paper 
tape only. Designating any other form on 
output file (except NOTHING and TELETYPE 
(another form of paper tape)) results in 
this message. It is possible to assemble 
an absolute program for loading by DDT. 
See 6.21 RELORG. 

There are too many nested macro calls, 
repeats, and ifs in combination. The stack 
provided for storing the previous source 
of input is full. This is a disaster. 
The program must be reorganized. 

No END statement was found at the end of 
the program. The assembler (except for 
typing this message) takes the same action 
as if it found the END statement. 



ILLEGAL COMMAND 



INPUT FILE NOT TEXT 



BAD CHAR 



EOF IN MACRO DEFINITION 



INPUT STACK UNDERFLOW. 



INPUT BUFFER FULL. 



The assembler does not recognize a command 
typed in by a user upon start-up. It 
makes him start again. 

The input file described to the assembler 
is not a type 3 file (i.e., text). 

An unrecognizable character (or one 
otherwise out of place) is found in the 
text. The character is typed out in octal 
following the message, replaced by a blank 
in the text, and assembly continues. 

The end of the program is reached, but the 
assembler is still defining a macro. Look 
for a missing ENDM. 

The opposite problem to the one above. Not 
terribly serious. Look for the presence of 
an extra ENDM, ENDR, or ENDF in the program, 

An input statement must be less than 320 
characters long. This message occurs when 
the rule is violated. It usually happens 
when macros run wild. Look carefully at 
the program near where the error occurred. 
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Error Message 

TOO MUCH MACRO RECURSION. 

TOO MUCH RPT RECURSION. 



Meaning 

Too many nested macro calls have occurred, 
resulting in filling available pushdown 
storage. Reorganize program. 

Similar to above. 



TOO MANY ARGS IN MACRO. 



TOO MANY REPEAT ARGS, 



STRING STORE EXCEEDED. 



EOF IN TEXT. 



The macro is being called with more 

arguments than there is space for. 

Reduce the number of arguments in the call. 

In beginning a repeat block> too many 
requests for automatic incrementing of 
symbols have been made. Reorganize the 
block. 

No space remains to store new macro 
definitions or to do repeats. Caution: 
old macro definitions are not thrown away. 
Do not redefine macros indiscriminately. 
Reorganize program. 

The end of the input file has occurred 
in the middle of a statement. 



8 . 2 Interpretation of the Error Listing 

When an error is listed on any file other than TELETYPE, the single- 
letter error message (first group above) is listed in the line below at 
the point where the error was detected. Other information is given. 
This is all depicted in the examples below. 

In the following line there are errors in the label and operand fields. 
00172 76 00000 UGHH LDA 



EEK+7 

7 

Current value of 
location counter is 
7 cells past the 
symbol EEK. 




Label cannot 
terminate with 

/• 



1 \ 



Relocation Expression 
error. cannot terminate 
with - . 
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20117 35 10761 



STA ZOT; 



M 



YIKES+1 



Location 

counter 

value. 



s 



MUMBLE 

T 



DOLT 



Missing tag 



Name of 
innermost macro 
in which offense 
occurred. 



\ 



Name of outermost 
macro in which 
offense occurred. 



Thus along with each error the location counter is printed out relative 
to the symbol most recently defined. In addition, if the error occurs 
during macro expansion the names of the innermost and outermost macros 
are printed to give a clue on where to look for the error. If only 
one level of macro expansion is involved, then only that name is listed. 
In order to save time when error listings are made on the teletype, 
the single-letter error messages are typed out at the left margin. 
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9. ASSEMBLER OPERATING INSTRUCTIONS 

ARPAS is called in the EXEC by typing 
. z ARPAS 

followed by depressing the return key on the teleprinter. The system 
responds with 

INPUT: 



requesting the user to type the file name of the symbolic file to be 
assembled. 

INPUT: /SYM/ 

/after typing his file name /SYM./ followed by a line feed, the system 
responds with BINARY: 

BINARY: /BIN/ 

The user types his selected file name, /BIN/, for storing the binary 
output of his assembly and again depresses the line feed key on his 
teleprinter. The system will respond with OLD FILE if the file name 
already exists in his file directory. Depressing the line feed key at 
this point will cause all existing information in this file to be replaced 
with the binary output from this assembly. Depressing AJt Mode or 
Escape will permit the selection of a new file name. "When the system 
types NEW FILE, typing a line feed will confirm the file name or typing 
an Alt Mode will permit the selection of a different file name. The 
teleprinter page appears as: . 

BINARY: /BIN/ 
OLD FILE 
or 

BINARY: /BIN/ 
NEW FILE 

If a carriage return is depressed after either OLD FILE or NEW FILE, 
the system responds with 

OK 

and pass one of the assembly begins. 

If a line feed is depressed after either OLD FILE or NEW FILE, an 
option is available to the user. 

TEXT OUTPUT:' TEL 
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If the option, TEXT OUTPUT, is selected, the user types TEL followed 
by a Carriage Return. The system responds with 

OK 

and pass one of the assembly begins. A program listing of the assembly 
will appear on the user's teletype. 

Typing a carriage return rather than TEL aborts the text output option 
and begins the assembly by typing 



ASSEMBLY EXECUTION 

If the text output option was not selected by the user, the system continually 
transmits non-printing characters to the user's teleprinter, giving him an 
audible indication the assembly is in process. At any time during the 
assembly, the user may type a single Alt Mode or Escape to activate listing. 
The listing will begin at the point in the program that is currently being 
assembled. It will continue to list on the teleprinter until the assembly is 
complete or the user types 



to stop the listing. This process may be repeated throughout the assembly 
process to determine how far the assembly has progressed. 

'When the assembly is complete, the number of cells used by the program 
is typed out as v/ell as a table of symbols by the program. For example: 

3453 CELLS USED BY PROGRAM 



BS 


N 45+ 


EBSM3 


N 


1466+ 




ENDBRS 


IN 3335+ 


SMB 


N 


0+ 




SRB 


N 13+ 


XSP 


N 21 + 




EXTERNAL SYMBOLS USED: 








ACTR 


ADMSX 


ARD A WD 




BPTEST 


.BRRL3 


BRSTV 


CARRY 


CBRF CET 




CHRL 


CIB 


CKBUF 


CLR.8P 


COB CPARW 


CPUPC 


CQO 


CRASH 


CRSW 
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APPENDIX A 








EXTENDED LIST OF INSTRUCTIONS 




Mnemonic 


Operation- Code 


Function 




Load/ St ore 








LDA 


76 


Load A 




STA 


35 


Store A 




LDB 


75 


Load B 




STB 


36 


Store B 




LDX 


71 


Load X 




STX 


37 


Store Index 




EAX 


77 


Copy effective address 


into index 


XMA 


62 


Exchange M and A 




Arithmetic 








ADD 


55 


Add M to A 




ADC 


57 


Add with carry- 




ADM 


63 


Add A to M 




MIN 


61 


Memory increment 




SUB 


5h 


Subtract M from A 




sue 


56 


Subtract with carry 




MUL 


61+ 


Multiply 




DIV 


65 


Divide 




Logical 








ETR 


11+ 


Extract (AND) 




MRG 


16 


Merge (OR) 




EOR 


17 


Exclusive or 




Register Change 








RCH 


1+6 


Register change 




CLA 


1+6 OOOOl 


Clear A 




CLB 


k6 00002 


Clear B 




CIAB 


1+6 00003 


Clear AB 




CLX 


2 1+6 00000 


Clear X 




CLEAR 


2 1+6 00003 


Clear A, B and X 




CAB 


1+6 00001+ 


Copy A into B 
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' Mnemonic 

CBA 
XAB 
BAC 
ABC 
CXA 
CAX 
XXA 
CBX 
CXB 
XXB 
STE 
LDE 
XEE 
CNA 
AXC 

Branch 
BRU 
BRX 
BRM 
BRR 
BRI 

Test/Skip 
SKS 
SKE 
SKG 
SKR 
SKM 
SKN 
SKA 
SKB 
SKD 



Operation Code 

1+6 OOOIO 

1+6 OOOlU 

1+6 00012 

1+6 00005 

1+6 00200 

o i+6 ooi+oo 

1+6 00600 

1+6 00020 

1+6 oooi+o 

1+6 00060 

1+6 00122 

1+6 0011+0 

1+6 00160 

1+6 01000 

1+6 001+01 



01 

1+1 

51 
11 



1+0 

50 

73 

.60 
70 

53 

72 
52 

7h 



Function 

Copy B into A 

Exchange A into B 

Copy B into A, Clearing B 

Copy A into B, Clearing A 

Copy X into A 

Copy A into X 

Exchange X and A 

Copy B into X 

Copy X into B 

Exchange X and B 

Store Exponent 

Load Exponent 

Exchange Exponents 

Copy negative into A 

Copy A to X, clear A 



Branch unconditionally 

Increment index and branch 

Mark place and branch 

Return branch 

Branch and return from interrupt 

Skip if signal not set 

Skip if A equals M 

Skip if A greater than M 

Reduce M, skip if negative 

Skip if A = M on B mask 

Skip if M negative 

Skip if M and A do not compare ones 

Skip if M and B do not compare ones 

Difference exponents and skip 
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Mnemonic 


Operation . Code 


Function 


Shift 






RSH 


66 OOxxx 


Right shift AB 


RCY 


66 20xxx 


Right cycle AB 


LRSH 


66 2kxxx. 


Logical right shift 


LSH 


67 OOxxx 


Left shift AB 


LCY 


67 20xxx 


Left cycle AB 


NOD 


67 lOxxx 


Normalize and decrement X 


Control 






HLT, ZRO 


00 


Halt 


NOP 


20 


No operation 


EXU 


23 


Execute 


Breakpoint Tests 






BFEx 


kO 20xx0 


Breakpoint test 


Overflow 






ROV 


22 00001 


Reset overflow 


REO 


22 00010 


Record exponent overflow 


OVT 


22 00101 


Overflow test and reset 


OTO 


22 00100 


Overflow test only 


Interrupt 




- 


EIR 


02 20002 


Enable interrupts 


DIR 


02 2000U 


Disable interrupts 


AIR 


02 20020 


Arm/disarm interrupts 


IET 


kO 20002 


Interrupt enabled test 


IDT 


1+0 20001+ 


Interrupt disabled test 


Channel Tests 






CATW 


1+0 11+000 


Channel W active test 


CETtf 


1+0 11000 


Channel W error test 


CZTW 


1+0 12000 


Channel W zero count test 


CITW 


1+0 10000 


Channel W inter-record te 


Input/Output 






EOD 


06 


Energize output D 
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Mnemonic 



Operation Code 



Input/Output (920 Compatible) 

MIW 12 

WIM 32 

PZN 33 ' 

POT 13 

EOM 02 

BETW kO 20010 

BRTW kO 21000 



Syspops 
BIO 
BRS 
CIO 
CTRL 
DBI 
DBO 
DWI 
DWO 
EXS 
FAD 
FDV 
FMP 
FSB 
GCD 
GCI 
ISC 
1ST 
LAS 
LDP 
LIO 
OST 
SAS 
SBRM 
SBRR 
SIC 
SKSE 

SKSG 



576 

573 
561 
572 

5*+3 
5hk 

5^5 
552 
556 
553 
55^ 
555 
537 
565 
5U1 
550 
5U6 
566 
552 
551 
5*+7 
570 
51* 
5*+0 
563 
562 



Function 

M. into W buffer when empty 
W buffer into M when full 
Parallel input 
Parallel output 
Energize output M 
W buffer error test 
W buffer ready test 

Block I/O 

Branch to system 

Character I/O 

Control 

Drum block input 

Drum block output 

Drum word input 

Drum word output 

Execute instruction in system mode 

Floating add 

Floating divide 

Floating multiply 

Floating subtract 

Get character and decrement 

Get character and increment 

Internal to string conversion (floating. 

output ; 

Input from specified teletype 

Load from secondary memory 

Load pointer (AB) 

Link I/O 

Output to specified teletype 

Store in secondary memory 

System BRM 

System BRR (prestored macro) 

String to internal conversion (floating 

input; 
Skip on string equal 

Skip on string greater 



Mnemonic 


Oper 


•at ion Code 


Function 


STI 




536 


Simulate teletype input 


STP 




567 


Store pointer 


TCI 




57*+ 


Teletype character input 


TCO 




575 


Teletype character output 


WCD 




535 


Write character and decrement 


WCH 




56U 


Write character 


WCI 




557 


Write character and increment 


WIO 




560 


Word I/O 



A-l 



APPENDIX B 
TABLE OF TRIMMED ASCII CODE FOR THE SDS 930* 
(NUMERIC ORDER) 
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SPACE 31 


9 


62 


R 




1 


HI 32 


i 


63 


S 


2 


It 


33 


5 


6k 


T 


3 


2 


3^ 


< 


65 


U 


k 


$ 


35 


= 


66 


V 


5 


* 


36 


> 


61 


W 


6 


& 


37 


9 


70 


X 


7 


i 


ko 


#>| 


71 


y 


10 


( 


kl 


A 


72 


z 


11 


.) 


1+2 


B 


73 


c 


12 


* 


k3 


C 


7^ 


a 


13 


+ 


kk 


D 


75 


3 


Ik 


* 


k5 


E 


76 


A< 




15 


- 


k6 


F 


77 


«- 


16 


• 


kl 


G 


Ikk 


EOT 


IT 


/ 


50 


H 


lk5 


WRU 


20 





51 


I 


lk6 


RU 


21 


l 


52 


J 


ikl 


BELL 


22 


2 


53 


K 


152 


LF 


23 


3 


5^ 


L 


155 


CR 


2k 


k 


55 


M 






25 


5 


56 


N 






26 


6 


57 









27 


7 


60 


P 






30 


8 


61 


Q 









*The Teletype characters enclosed in "boxes cannot "be handled "by 
ARPAS and are converted to blanks when present. 
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1.0 General 

DDT is the debugging system for the SDS 930 Time-Sharing System. It has 

facilities for symbolic reference to and typeout of memory locations and central 

registers. Furthermore, it permits the use of literals in the same manner as in 

the assembler. It can also insert breakpoints into programs, perform a trace, 

and search programs for specified words and specified effective addresses. There 

is a command to facilitate program patching. Finally, DDT can load both absolute 

and relocatable files in the format produced by the assembler. 

The system has a language for communication between DDT and its users. The 

basic components of this language are symbols , constants, and commands . 

1 . 1 Symbols 

A symbol is any string of letters, digits, and dots (.) containing at 

least one letter. (However, a digit string followed by B or D is interpreted 

as an octal or decimal number respectively). In symbols of more than six 

characters, only the first six are significant: thus, ALHIABET is equivalent 

to ALPHAB- All opcodes recognized by the assembler are built-in symbols, 

except for some i/o instructions. Other symbols are ;1, ;2, ;A, ;B, ;F, 

;L, ;M, ;Q, ;X, and dot. Their meanings are explained below. 

Every symbol may have a value. This value is a 2H-bit integer; for 

most symbols it will be either an address in memory or the octal encoding of 

an operation code. Examples: 

ABC 

AB12H 

12XYZ 

The following are not symbols: 

135B 
AB*CD 

Symbols may be introduced to DDT in two basically different ways: 

(A) They may be written out by the assembler and read in from 

the binary program file by DDT. 
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(b) They may be typed in and assigned values during debugging. 

It is possible for a symbol to be undefined. This may occur if a program 
is loaded which references an external symbol not defined in a previously 
loaded program. It may also occur if an undefined symbol is typed in an 
expression. In general, undefined symbols are legal input to DDT except 
when their values would be required immediately for the execution of a 
command. Thus, for example, the ;G (GO TO) command could not have an un- 
defined symbol as its argument. 

Undefined symbols may become defined in several ways. They may be 
defined as external in the assembler (i.e. with EXT, ENTRY, or $) and read 
by DDT as part of a binary program. Alternatively, they may be defined by 
one of the symbol definition commands available in DDT. When the definition 
occurs, the value of the symbol will be substituted in all the expressions 
in which the symbol has appeared. 

If DDT type [U] after typing out the contents of a register, it means 
that the register contains an undefined symbol. The register is closed at 
once so that its contents cannot be erroneously changed. 

The only restriction on this facility is that, as for ARPAS, the 
undefined symbol must be the only thing in the address field of the word 
in which it appears. Incorrect uses of undefined symbols will be detected 
by DDT and will result in the error comment (ll). 

DDT keeps track of references to undefined symbols by building a 
pointer chain through the address fields of the words referring to the symbol 
Thus, suppose that the symbol A is undefined and appears as follows 

51 LDA A 

52 STA A 

53 MRG A 

and nowhere else in the program. After loading, the entry for A in DDT's 
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symbol table will contain a flag indicating that it is undefined and a pointer 
to 3. The above locations will contain: 

51 LDA 

52 STA SI 

53 MRG S2 

When the symbol is defined, DDT goes through the pointer chain and fills in 
the value. It recognizes the end of the pointer chain by a address. 

From this description it should be obvious what will happen if the 
pointer chain is destroyed. A probable consequence is that a search down the 
pointer chain will not terminate. DDT does such searches whenever it prints 
an address. If the chain it is searching has more than 256 links, it will 
print the symbol followed by (u) and continue. Fixing up an undefined symbol 
pointer chain which has been clobbered is an exercise which we leave to 
the reader. 

1.2 Block Structure 

A limited facility called the block structure facility is provided to 
simplify the referencing of local symbols which are defined in more than one 
program. Note that DDT's block structure has only a tenuous connection 
with the block structure of ALGOL. The block structure of a program is 
organized in the following manner: every IDERT read by DDT as part of a 
binary program file begins a new block. Any local symbol known to DDT hae a 
block number associated with it; global symbols do not have a block number. 
Undefined symbols are always treated as global. 

The name of a block is the symbol in the label field of the IDENT. If 
two IDENTs with the same symbol are read, the message (AlitEADY DEFINED) is 
printed, and the local symbol tables from the two blocks will be merged. 
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Global symbols must be unique within an entire program and are recoil zed 
at all times. If a multiple definition is encountered, the latest one takes 
precedence. Local symbols are recognized according to the following rules: 

(l) At any given time one block is called the primary block. All local 

symbols associated with the primary block will be recognized. 
{?.) If a symbol is used which is neither global nor in the primary 
block, the entire symbol table is scanned for it. If it occurs 
in only one block, the symbol is recognized properly. If it occurs 
in more than one block, the error message (a) is printed. 
(3) A symbol may be explicitly qualified by writing: 
SYMA&SYMB 
SYMA must be the name of a block. SYMB is then referenced as though 
the block whose name is SYMA were primary. 
(U) When a register is opened (see section 2.1), the block to which 
the symbolic part of its location belongs becomes primary. Thus, 
NN&XYZ/ causes block NN to become primary; if ABC is a unique local 
symbol in block PQ, then ABC/ causes block PQ to become primary. 
1.3 Literals 

Literals have the same format and meaning in DDT as in the assembler, 
i.e. the two characters' =' signal the beginning of a literal, which is 
terminated by any of the characters which ordinarily terminate an expression. 
In contrast to the assembler, the expression in a DDT literal must be defined. 

The literal is looked up in the literal table. If it is found, the 
address which has been assigned to it is the value of the symbol. If it 
does not appear in the literal table, it is stored at the address which is 
the current value of ;F, and this address is taken as the value of the 
literal. ;F is increased by 1. For example, if the literal -1 does not 
already exist in the literal table and ;F is 10O0B, then LDA =-1 causes -1 
to be stored at 1000B, and is equivalent to LDA 1000B; the new value of ;F 
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is 1001B. Exception: In patch mode, literals are saved and not stored 
until the patch is completed since otherwise they would interfere with 
the patch. 

When DDT types out a symbol whose value is an address in the literal 
table, it will type out in the same format in which it would be input; that 
is, as = followed by the numeric value of the literal. 
1.1* Constants 

A constant is any string of digits, possibly followed by a B or D. The 
number represented by the string is evaluated, truncated to 2h bits and then 
used just like the value of a symbol. The radix for numbers is normally 8 
(octal), but may be changed arbitrarily by the commands described in section 
2.1* below. If a number is terminated by B or D, it is interpreted as octal 
or decimal respectively regardless of the current radix. Constants are 
always printed by DDT in the current radix. 

It is possible to enter numeric op codes by typing the number followed 
by an (£ sign. Thus 100<6 =lM*00000B if the current radix is decimal (100D=1UUb) 
1-5 Commands 

A command is an order typed to DDT which instructs it to do something. 
The commands are listed and their functions explained in the table Yjelow. 
1.6 Expressions 

An expression is a string of numbers or symbols connected with blanks, 

+, -, ;*, ;/, ;&, ;<, ;=, ;>, and ;$. These operators have the following 

significance: + addition 

subtraction 
* (integer) multiplication 
/ (integer) division 
& (AND) ") 

< (LSS) / 

= (EQL) > as in AKPAS 

> (GTR) [ 

i (OR) J 

Expressions are evaluated strictly left to right: all operators have the 

same precedence. Parentheses are not allowed. The first symbol or number 
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may be preceded by a minus sign. Blank acts like plus, except that the 
following operand is truncated to lU bits before being added to the 
accumulated value of the expression. The value of an expression is a 2^ -bit 
integer. An expression may be a single symbol or constant. 

Examples: IDA has the value 76OOOOO 

LDA 10 has the value 76OOOIO if the 

radix is octal 

LDA 10D has the value 7600012 

If SYM is a symbol with the value 1212, then 

SYM has the value 1212 

SYM 10 has the value 1222 

LDA SYM has the value 07601212 

If this last expression were put into a memory register and later 
executed by the program the effect would be to load the contents of SYM, 
register 1212, into the A register. 

When DDT types out expressions, two mode switches control the format of 
the output. Commands for setting these modes are described in section 2.U 
below. The C-S mode determines whether quantities will be printed as constants 
or as symbolic expressions. In the latter case, the opcode (if any) and the 
address will be put into symbolic form. If the first nine bits of the value 
are or 1, no opcode will be printed; in the latter case a negative integer 
will be printed. If the opcode is not recognizable as a symbol, it will be 
typed as a number followed by an @ sign. 

The R-V mode controls the format in which addresses are typed. DDT 
types addresses when asked to open the previous or the next register, when 
it reports the results of word and address searches, and on breakpoints. 
In relative mode, addresses are typed in symbolic form, i.e., as the largest 
defined symbol smaller than the address plus a constant if necessary. If 
the constant is bigger than 200 octal, or if the value of the symbol is 
less than the first location of the program, the entire address is typed « ri .s 
a constant. In absolute mode, addresses are always typed as constant. 
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1.7 The Open Register 

One other major ingredient of the DDT language is the open remoter . 
Certain commands cause a register to be "opened". This means that its 
contents are typed out (except in enter mode, for which see the \ command), 
followed by a tab. Any expression the user types will then be inserted into 
the open register in place of its current contents. After this insertion the 
register is closed at once. Note that the string LDA ABC= is a command, and 
does not cause LDA ABC to be entered into the current open register. The 
current location is given by the symbol " . " (dot) which always has as its 
value the address of the last register opened, whether or not it is still open 

Note : 

(1) Comma and star (for indirect addressing) may be used in expressions 
as they are used in the assembler; e.g. LDA* 0,2 has the value 

27 6^0000. 

(2) DDT will respond to any illegal input with the character ? followed 
by a tab (if a register is open) or carriage return (otherwise), 
after which it will behave as if nothing had been typed since the 
last tab or carriage return. The command ? also erases everything 
typed since the last tab or carriage return. 

1.8 Memory Allocation and DDT 

DDT may cause the time-sharing system to assign memory for use either 
by DDT itself or by the user's program. DDT's memory is used to hold the 
symbol table, which starts in block and grows upward in memory. The 
symbol table contracts at the end of each load of a binary file and when 
symbols are killed; this contraction may cause memory to be released. 

DDT grabs program memory when it is required for loading a binary file, 
or when a ;U (execute) command is given and the value of ;F is such that 
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a new block is needed to hold the instruction to be executed. For executing 
an instruction, DDT requires location ;F, ;F+1 and ;F+2. Memory is never 
grabbed for examination of a register; however, entering information with \ 
can cause memory to be assigned. Attempts to open locations not assigned 
will cause DDT to type ?. This means that upon initial entry to DDT no 
registers are available for examination. The easiest way to obtain memory 
is to simply start typing in a program using the \ command. 

If an attempt to acquire or reference memory leads to a trap, DDT types 
(M) and abandons whatever it is doing. This can happen if the machine size 
is exceeded, or if an attempt is made to change read-only memory. 
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2.0 DDT Commands 

In the following descriptions of DDT commands, <S> will be used to denote 
an arbitrary symbol. <E> or <W> will be used to denote an arbitrary expression 
which may be typed by the user: <E> will be used when the value of this expression 
is truncated to ik bits before it is used by DDT, while <W> will denote a full 
2U-bit expression. <A> will be used to denote an optional lH-bit expression. 
If none is typed, the last expression printed out will usually be used; deviations 
from this rule will be described under the individual commands. <F> will denote 
a file name followed by a dot: DDT will type a tab whenever it expects a file 
name . 

2.1 Register Opening Commands 
<A> / This opens the register addressed by the value of <A>. DDT will give 
a tab, type an expression whose value is equal to the contents of the 
register, give another tab and await further commands. The precise form of 
the expression typed is dependent on the setting of the S-C and B-V modes. 
If the user types in an expression, DDT will insert its value into the 
register. Typing another command closes the register, unless it is a type 
value or symbol definition command. Note that in a command that requires 
a preceding expression, the expression is regarded as part of the command 
and would not, for instance, be inserted into the open register. If another 
/ is given as the next command with no preceding expression the contents 
of the register addressed by the expression typed by DDT are typed out. A 
further / repeats this process. Note, however, that the original register 
opened remains the open register; any changes made will go into that register, 

carriage This command does not necessarily have any effect. If the specified 
return 

conditions are present, however, any of the following actions may occur: 
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(l) If there is an open register, the register is closed. 

(?) If DDT is in enter mode, it leaves it. 

(3) If DDT is in patch mode, the patch is terminated (for a 

fuller description of this effect, see the patch command). 

<A> ] This command has the same effect as /, except that the contents of 

the register opened are always typed in symbolic form. 
<A> [ This command has the same effect as /, except that the contents of 

the register opened are typed in constant form. 
<A> $ This command has the same effect as /, except that the contents of 

the register opened are typed as a signed integer. 
<E> " This command acts like /, except that the register constants are typed 

in ASCII. Unprintable characters, as in QJED, are preceded "by &, e.g. lHl 

(control-A) prints out as &A. 

line This command opens the register whose address is the current location 
feed 

plus one, iJe. the register after the one just opened. The output of DDT on 

this command is carriage return, register address (format controlled by 

the R-V mode), /, tab, value of contents, tab. 

;o(^=space) This is equivalent to line feed except that nothing is printed. 

Its main use is in entering programs or data, e.g. 

1000 l;o2;^3 (carriage return) 

is equivalent to 

10O0\ 1 (carriage return) 

1001\ 2 (carriage return) • 

1002\ 3 (carriage return) 

t This command opens the register whose address is the current location 

minus one, i.e. the previous register. The output is the same as for the 

line feed command. 



Example : 










ABC/ 


LDA 


ALPHA 




(line feed) 


ABC+l/ 


STA 


BETA 


STA GAMMA 


(line feed) 


ABC +2/ 


LDB 


DELTA 


t 




ABC+l/ 


STA 


GAMMA 
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( This command opens the register whose address is the last ik bits 

of the value of the last expression typed. The output is the 3arne as for 
line feed. 

\ This command is the same as /, except that the contents of the register 
are not typed. DDT goes into enter mode, in which the contents of registers 
opened by line feed, t, or ( are not typed. Any other command caused DDT 
to go out of enter mode. In particular, carriage return has this effect. 
When a register has been opened with \, DDT thinks that it has typed out 
the contents. The type value commands will, therefore, work on the contents 
of the register. 

The type register in special mode characters [, ], $ (type as a negative 
integer), " (type in ASCII) are also preserved by line feed, up arrow and (. 

;\ This command suppresses typeout of register addresses during line feed, 
up arrow and ( chains. Carriage return cancels the command. 

2.2 Type Value Commands 
= This command types the value of the last expression typed (;Q) in 

constant form. It may appear in the form <W> =, in which case the value 

of the <W> is typed. Otherwise, the expression referred to is the one most 

recently typed, either by DDT or by the user. 
$ ' This command types the value of ;Q as a signed integer. 

«- This command types the value of ;Q in symbolic form. 
' This command types the value of ;Q typed as a word of text (see " command 

on previous page). 
@ This command types the address part of ;Q in symbolic form. If, for 

instance, the program has executed BRM X, then X\@ will cause DDT to print 

the address of the BRM. 
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Example : 




LDA= 


7600000 


LDA 10= 


7600010 


LDA <- 


LDA 


7600000*- 


LDA 


-1= 


77777777 


-1# 


-1 


77777777# 


-1 


102210^3' 


ABC 



2.3 Symbol Definition Commands 
<S> : This command defines the value of the symbol < S> to be the 

current location. If <S> has been used but is undefined, it becomes global; 

otherwise it becomes local and associated with the block which is primary 

when the : command is given. 
<S> & ?> This command defines the value of <S> to be the address of the last 

expression typed by DDT or the user. The symbol is local and associated with 

the block which is primary when the @ command is given. 
\<S>y This defines <S> to have the value of <E>, and to be global. 

2.U Mode Changing Commands 
" This command is followed by a string of arbitrary characters terminated 

by D (control D). If a register is open, the string will be inserted into 

successive locations packed 3 characters per word; otherwise characters 

beyond the third will be thrown away. Tor example, if no register is open, 

"ABCDED C = yields 102210^3- 
;D (DECIMAL) This command changes the current radix (see section l.k). 
;0 (OCTAL) This changes the current radix to octal. 
<E> ;R (RADIX) sets the current radix to the value of the expression, which must 

be >2. 
;[ (CONSTANT) This command changes the S-C mode to constant, i.e. makes / 

equivalent to [ . 
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', ] (SYMBOLIC) This command changes the S-C mode to symbolic, i.e. 

makes / equivalent to ] . 
;" (ASCII) This makes / equivalent to ". 
;$ (SIGNED INTEGER) This makes / equivalent to $. 
;F (RELATIVE) This command changes the R-V mode to relative. This 

mode determines the format for the output of addresses, both in symbolic 

expression and when generated by line feed and t. 
;V (ABSOLUTE) This command changes the R-V mode to absolute. 

2-5 Breakpoint Commands 

<A>,<E>! (BREAKPOINT) <E>! sets breakpoint at the address given by the value 
of the expression; <N>, <E>! sets breakpoint N (N must be between and 3 
inclusive). The effect is that if the program executes the instruction at 
this address control returns to DDT, which will print the address and the 
contents of the A, B and X registers and await further commands (see below). 
The break occurs before execution of the instruction in the breakpoint 
location. ;L is set to the location at which the break occurred. 

! (CLEAR ALL BREAKPOINTS). ! alone causes all breakpoints to be cleared 

<A>;! (LIST OR CLEAR BREAKPOINTS) 

<N>;! causes breakpoint N to be removed, where N lies between and 3 

inclusive. ; '. alone causes all breakpoints to be listed: if breakpoint 1 
is set at ABC+3* and no other breakpoints are set, then ;! produces the 
printout * ABC +3 * * . 

<A>;P (PROCEED) This command restarts the program after a break. The 

program executes the instruction at the break and goes on from there. No 
breakpoint is removed unless this is specifically done by ! or ; 1 so that, 
if the program arrives at this location again, another break will occur. 
If <E>;P is given, another break will not occur until some breakpoint has 
been reached that many times. 
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<A>;N (NEXT) This command executes the instruction at ;L and breaks. 

This program provides a trace facility in that repeated executions of ;N 
will provide a running print out of the contents of the significant internal 
registers, instruction by instruction. The function is essentially the same 
as that of the step switch on the console. <E>;N will cause <E> instructions 
to be executed before the next break occurs. 

The ;N command follows the flow of control in the user's program. In 
particular, it will normally trace the execution of users' POPs (see ;0 
below). The execution of SYSPOPs, however, is not traced. In other words, 
a SYSPOP such as FAD (floating add) is regarded as one instruction by ;N. 
Cells ;F, ;F+1, and ;F+2 are used by ;N and ;P. 

<E>;S (STEP). This is equivalent to <E> repetitions of ;N. Note that this 
is not the same as <E>;N. 

<E>;V (ADVANCE). This is equivalent to <E> repetitions of <P, and is not 
the same as <E>;P. 

<N>;0 (POP TRACE MODE). If <K»0, programmed operators (POPs) together 

with their associated subroutines will be treated like machine instructions 
for the ;N and ;S commands, i.e. the break will not occur until control 
returns to the location following the POP. Since DDT determines when it 
should break by counting POPs, BRMs, SBRMs, BRRs and SBRRs, it can be 
fooled by POPs which do sufficiently peculiar things. If <N><0, POP 
subroutines will be traced, i.e. the first break after the POP will be at 
the first instruction of the subroutine. 

<N>;U (SUBROUTINE TRACE MODE). If <N>=1, BRMs or SBRMs together with the 
subroutine called will be treated as single instructions by ;N. The same 
algorithm is used as in ;0 to determine when to break. If <N>=0, subroutines 
will be traced explicitly. 
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Attempts to proceed through certain instructions having to do with 
forks will produce erroneous results, and breakpoints encountered when 
the program is running in a fork will not do the right thing. Attempts 
to proceed through unreasonable instructions will cause the error comment 

$ > > . 

2.6 Input/Output Commands 

<A>;Y<E> DDT expects to find a binary program on the file <F>. If thc- 

program is absolute it is read in. If it is relocatable it is read in and 
relocated at the location specified by <A>. If the expression is omitted, 
relocatable loading commences at location 2U0B and continues by beginning 
each program in the first available location after the preceding one. 
After reading is complete, the first location not used by the program is 
typed out. Any local symbols on the binary file are ignored. 

<A>;T<F> This command is identical to ;Y except that is also reads local 

symbols from the file and adds them to DDT's symbol table. Any symbols on 
the file will be recognized by DDT thereafter. 

The following two points should be noted in connection with ;Y and ;T 
commands. 

1) The use of an expression before ;T or ;Y when the file is 
absolute (i.e. SAVE file or self-loading paper tape) is in error. 

2) The block read in becomes the primary block. 

;W<E> Causes all global symbols to be written on the specified file, in a 

format which can be read back in with ;T. 
;C<E> Causes all symbols to be written on the specified file. 

2.7 Search Commands 
<W>;W (WORD SEAECH) <W>;W searches memory between the limits ;1 and ;2 for 

cells whose contents match <V£> when both are masked by the value of ;M. 
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The locations an/5 contents of all such cells are typed out. 
<W>*# (NOT-WORD SEARCH). This is the same as ;W, except that all registers 
which do not match <W> will be printed. This is useful, for example, in 
finding and printing all non-zero registers in a given part of memory. 
<E>*E (EPTECTIVE ADDRESS SEARCH). <E>;E searches memory between the limits 

•1 and ;2 for effective addresses equal to <E>. Indexing, if specified, is 
done with the value of ;X. Indirect address chains are followed to a depth 
of 6*+. The addresses and contents of all words found are typed out. When 
•W or *E is complete, . is left pointing to the last register where the 
expression was found. 

2.8 The Patch Command 
<A> ) <A> ) causes a patch to be inserted. If a register is open and an 
expression is given, the expression is entered into the register. If a 
register is open, or if no expression is typed, the patch is made at . 
Otherwise, the patch is made at <A>. DDT inserts in this location a 
branch to the current value of ;F. When the patch is done, ;F is updated. 
It then gives a carriage return and a ) and waits for the user to type 
in the patch. Legal input consists of a series of expressions whose 
values are inserted in successive locations in memory. Each of these 
expressions should be terminated by line feed or ; t>, exactly as though the 
program were being typed in with the \ command instead of as a patch. The t 
command may be given in place of the line feed and has its usual meaning, 
except that the contents of the previous location are not typed. Two 
other commands are legal in patch mode. They are: 

(l) Colon, which may be used to define a local symbol with value 
equal to the current location. 



30. Ho. 10 

2-9 

December 2, 1966 

(2) Carriage return, which terminates the patch. When the 
patch is terminated, DDT inserts in the next available 
location the original contents of the location at which the 
patch was inserted. It then inserts in the following two 
locations branch instructions to the first and second locations 
following the patch. This means that if the patch command is a 
skip instruction, the program will continue to operate correctly. 
Any other command given in patch mode may cause unpredictable 
errors. 
<A>;I Is identical to the ) command except that it puts the instruction 

being patched before the new code inserted by the programmer instead of after 

2.9 Miscellaneous Commands 
;? and ? This commands erase everything typed since the last tab or carriage 

return. It is always legal. 
<E>;G (GO TO) <E>;G restores the A, B and X registers which were saved when 

DDT was entered (unless they have been modified) and transfers to the 

location specified by the value of the expression. 
;K (KILL) This command resets DDT's symbol table to its initial state. 

DDT will type back — OK and wait for a confirming dot. Any other character 

will abort the command. 
<S>;K (KILL). Removes only the symbol <S> from the table. 
<E>,<E>;L Sets ;1 and ;2 (the lower and upper brounds for searches) to the 

values of the first and second expressions respectively. 
;U (UNDEFINED). This command causes all undefined symbols to be listed. 
<E>;U (EXECUTE). This causes the value of the expression to be executed as 

an instruction. If it is a branch, control goes to the location branched to. 

In all other cases control remains with DDT. A single carriage return is 

typed before execution of the instruction. If the instruction does not 
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branch and does not skip, or returns to the following location, a $ and 
another carriage return are typed after its execution. If the instruction 
does skip, two dollar signs ($$) are typed followed by a carriage return. 

;Z (ZERO) <E>,<E>;Z sets to zero all locations between the value of the 
first expression and that of the second. ;Z alone releases all memory 
accessible to the user's program. DDT will type back — OK and wait for a 
confirming dot. Any other characters will abort the command. If this 
memory is returned, due to later access by DDT or a program, it will be 
cleared to zero. 

$& (LIST BLOCKS). The names of all blocks are printed. 

2.10 Special Symbols 

The value of "." is the current location, i.e. the address of the 
last register opened. 

The following symbols refer to various special registers of the machine 
Their value is the contents of these registers as saved by DDT: ;X= will 
print the saved contents of the X register. To change the contents of a 
register, a command of the form <E>;A is used. This command sets the A 
register to the value of the expression. Whenever DDT executes any command 
involving execution of instructions in the user's program, it restores the 
values of all machine registers. If any of these values have been changed 
by the user, it is the changed value which will be restored. 

;A The value of this symbol is the contents of the A register. 

;B The value of this symbol is the contents of the B register. 

;X The value of this symbol is the contents of the X register. 

;L The value of this symbol is the contents of the program counter. 

The only reason for changing ;L is to set the location from which ;N will 
begin execution. 
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The values of the following special symbols are used by DDT in certain 

commands or are available to the programmer for his general enlightenment. 

These values may be changed in the same way that the values of the symbols 

for the central registers of the machine may be changed. 
;M The value of this symbol is the mask for word searches. 
;1 The value of this symbol is the lower bound for word and effective" 

address searches. It may also be set by the ;L command. 
;? The value of this symbol is the upper bound for word and effective 

address searches. It may also be set by using ;L. 
;Q This symbol has a value equal to the value of the last expression 

typed by DDT or the user. It is useful, for instance, if the programmer 

wishes to add one to the contents of the open register; he need only type 

;Q + i. 

;F The value of this symbol is the address of the lowest location in core 
not used by the program. New literals and patches are inserted starting 
at this address. Note: like all other special symbols, ;F may be changed 
by the command <E>;F. It is also updated as necessary by patches and 
literal definitions. 

2.11 Panics 

DDT recognizes four kinds of panic conditions: 

(1) Illegal instruction panics from the user's program. 

(2) Memory allocation exceeded panics from the user's program. 

(3) Panics generated by pushing the rubout button. 

(k) Panics generated by the execution of BRS 10 in the user's program. 

For the first two of these conditions DDT prints out a message, the 
location of the instruction at which the panic occurred, and the contents of 
this location. The messages are as follows: 
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(1) Illegal instruction panic I > > 

(2) Memory allocation exceeded M > > 

(3) The other two types of panics cause DDT to type bell and 
carriage return. ;L and . will "both he equal to the location 
at which the panic occurred. 

If a memory allocation exceeded panic is caused by a transfer to an 
illegal location, the contents of the location causing the panic is not 
available and DDT, therefore, types a ?. 

Two other panic conditions, are possible in DDT. 

(l) If the rubout button is pushed twice with no intervening typing 

by the user, control returns to the executive. 
(?) If the rubout button is pushed while DDT is executing a command, 

execution and typeout are terminated and DDT types carriage return 
and bell and then awaits further commands . 

2.12 Multiple Program Debugging 

It is occasionally desirable to hold several programs with different 

maps and. symbol tables in DDT simultaneously. This situation could be 

approximated using the DUMP and RECOVER commands in the time-sharing 

executive, but several commands are provided in DDT itself to facilitate 

the process. 
<W >,<W^>;R (SET MAP). The pseudo-relabeling for the program is set according 

to the value of <W > and <WJ>. This command is essentially equivalent to 

executing BRS kk with <WJ> in A and <Wj> in B. 
$E (ERASE). DDT types — OK and waits for a confirming dot. Any other 
character will abort the command. DDT then resets itself to its initial 

state, i.e. the symbol table, program map, breakpoints and modes are all 

reset. The program memory, however, is not released. 
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%D (DUMP). This command also requires a confirming dot. The 

entire state of DDT is saved away and a number typed out which will allow 
this state to be retrieved by the $R command (see below). DDT then 
resets itself as described under $E above. 

<E>#R (RECOVER). This command requires a confirming dot. If the present 
state of DDT has ever been dumped (i.e. was produced by $R), it is dumped 
again. Then the state is restored exactly as it was when the %D was given, 
whose number was the value of <E>. Using an illegal number for ^R can 
lead to chaos. 



DDT COMMAND 
SUMMARY 



DDT SUMMARY CDDT1, 3-30-68) 

6- (THERE ARE 245 LINES IN THIS FILE) 

KEY: 

E SYMBOLIC OR NUMERIC EXPRESSION 

S SYMBOLIC EXPRESSION 

N NUMERIC 

(N) GROUP NUMBER 

R OPEN REGISTER 

APPLIES TO OPERAND 

GROUPS: 

(1) COMMANDS CONCERNING RADIX 

(2) COMMANDS TO EVALUATE EXPRESSIONS 
C3) OPENING REGISTERS 

(4) COMMANDS CONCERNING MODES 

(5) CLOSING REGISTERS AND INDIRECT ADDRESSING 

(6) CENTRAL REGISTERS 

(7) SPECIAL REGISTERS 

(8) SYMBOL DEFINITION 

(9) SYMBOL CONTROL 

( 10) WORD SEARCH 

(11) PROGRAM ALTERATION 

(12) PROGRAM EXECUTION AND TRACING 

(13) I/O 

(14) ARITHMETIC 
/' ( 15) LOGICAL 

~V (16) DDT STATE & RELABELLING 

^ (17) MISCELANEOUS 

(18) STRING PROCESSING 

(1) RADIX 

n5r sets radix to n 

;d sets radix to 10 

;o sets radix to s 

NB TAKES N AS BINARY (OCTAL) 

ND TAKES N AS DECIMAL 

(2) EVALUATE EXPRESSIONS 

(THE E BELOW MAY ALSO BE R OR OPEN REGISTER) 
• 

E= TYPES VALUE OF E AS AN UNSIGNED INTEGER 

E# TYPES VALUE OF E AS A SIGMED INTEGER 

E- TYPES E IN SYMBOLIC 

E« TYPES E AS TEXT (3 OR 4 CHARS.; SEE(4)) 

RO § TYPES OPERAND IN SYMBOLIC 

E;~ TYPES E AS SYMBOLIC STRING POINTER (SEE (18)) 

E; ' ASSUMES E IS ADDRESS OF A PAIR OF STRING POINTERS; TYPES STRING 



(3) OPEN IMG REGISTERS 



c 



E/ 
t 

LF 

; SPACE 

E C 

E 3 

E $ 

E M 

E\ 



; 3 

; c 

;$ 



;r 

; v 



OPENS 
OPENS 
OPENS 
SAME 
OPENS 
OPENS 
OPENS 
OPENS 
OPENS 



LOCATION 

PREVIOUS 

FOLLOWIN 

AS LF 

LOCATION 

LOCATION 

LOCATION 

LOCATION 

LOCATION 



E AND TYPES ITS CONTENTS IN THE CURRENT MODE 
LOCATION IN SAME MODE USED FOR CURRENT REGISTLn 
G LOCATION IN SAME MODE USED FOR CURRENT REGISTER 



AND TYPES ITS CONTENTS AS AN UNSIGNED INTEGER 

AND TYPES ITS CONTENTS IN SYMBOLIC 

AND TYPES ITS CONTENTS AS A SIGNED INTEGER 

AND TYPES ITS CONTENTS AS TEXT (3 OR 4 CHARS.) 
WITHOUT TYPING CONTENTS 



(ONLY OPEN REGISTER COM- THAT APPLIES TO NEW MEMORY) 



C4) MODES (SETS THE CURRENT MODE AS INDICATED) 
(4.1) REGISTERS 



SYMBOLIC (SIGNED INTEGER IF MORE THAN 200 FROM SYMBOLIC LOC. ) 

(MODE USED WHEN ENTERING DDT) 
UNSIGNED INTEGER 
SIGNED INTEGER 
ASCII (TEXT) (3 OR 4 CHARS. PER WORD (SEE (4.3)) 



(4.2) LABELS 



LABELS 
LABELS 



TYPED 
TYPED 



IN 
AS 



RELATIVE 
INTEGER 



SYMBOLIC (MODE UPON ENTERING DDT) 



;3 
;4 



(4.3) TEXT 

THREE CHARACTERS PER WORD 

FOUR CHARACTERS PER WORD (DOES NOT APPLY TO i % 



c 



COMMAND) 



(5) REGISTER CLOSING AND INDIRECT ADDRESSING 

(CONTENTS OF OPEN REGISTER PLACED BACK IN MEMORY) 

RO ( OPENS LOCATION (OPERAND) OF THE LAST EXPRESSION TYPED OR 

OF THE OPEN REGISTER (WHETHER TYPED OR NOT) 
t CLOSES CURRENT REGISTER AND OPENS PREVIOUS 

LF CLOSES CURRENT REGISTER AND OPENS FOLLOWING 

CR CLOSES CURRENT REGISTER 



(6) central registers 

;a(2) types the conttens of 

e;a stores the value of e 

;b(2) types the contents of 

e;b stores the value of e 

;x(2) types the contents of 

e;x stores the value of e 

;l(2) types the contents of 

EjL . STORES THE VALUE OF E 
(7) SPECIAL REGISTERS 



THE A REGISTER IN MODE OF COMMAND (2) 

IN THE A REGISTER 

THE B REGISTER 

IN THE B REGISTER 

THE X REGISTER 

IN THE X REGISTER 

THE LOCATION COUNTER 

IN THE LOCATION COUNTER 



;F(2) TYPES THE CONTENTS OF THE FIRST AVAILABLE MEMORY REGISTER 
E;F STORES THE VALUE OF E AS THE FIRST AVAILABLE MEMORY 

%0(2) TYPES THE CONTENTS OF THE ORIGIN REGISTER (FIRST USED MEMORY) 
S%0 STORE THE VALUE OF E AS THE CURRENT ORIGIN 

(OPERANDS AND LABELS BELOW ORIGIN ARE TYPED AS INTEGERS) 



s;k 


KILLS 




IS 


;k 


KILLS 


;u 


LISTS 


%& 


TYPES 


E&5K 


KILLS 



(8) SYMBOL OR LABEL DEFINITION 

E<S> ASSIGNS THE VALUE OF E AS THE VALUE OF THE SYMBOL S 
S: ASSIGNS THE CURRENT LOCATION (.) AS THE VALUE OF THE SYMBOL 

RO S@ ASSIGNS THE OPERAND OF THE OPEN REGISTER AS THE VALUE OF THE 
SYMBOL 

(9) symbol control 

symbol s (if s is undefined, the undefined queue 

left in memory but the symbol is killed) 

all symbols (type cr after --ok) 

all undefined symbols 

block i dents. 

all local symbols in block 

(10) word search 

ejm set the mask (used by ; w) to the value of e 

em sets the lower bound for a search to the value of e 

e;2 sets the upper bound to the value of e 

e1,e2;l sets the lower bound to el and the upper bound the e2 

e;w searches memory between lower and upper bounds for locations 

which match e when both are masked by the value of ;m 
e;e search memory between lower and upper bounds for effective 

address equal to e 
e;# same as ;w except that all words not matching are typed 
£' . ;# types all words that are not zero 

r ' \ ;e TYFEL all WORDS THAT have operands equal to zero 

(11) PROGRAM ALTERATION (PATCHES) 

E) CAUSES INSTRUCTIONS TO BE INSERTED BEFORE LOCATION E 

) CAUSES INSTRUCTIONS TO BE INSERTED BEFORE CURRENT LOC- (.) 

E>I CAUSES INSTRUCTIONS TO BE INSERTED AFTER LOCATION E 

;i CAUSES INSTRUCTIONS TO BE INSERTED AFTER CURRENT LOCATION 

(INSTRUCTIONS PATCHED, PATCHES, AND LITERALS GO AT 

LOCATION ;F WHICH MOVES % F) 

E1,E2;Z CLEARS LOCATIONS BETWEEN El AND E2 



r\ 



(12) PROGRAM EXECUTION 



(j m;ei 

! 

; ! 

n; ! 
e;g 

E%G 

;p 
n;p 

;n 

n;n 

n;s 

n; v 

o;o 

1 ;o 

o;u 

• 1 ;u 



ADDRESS E 

N CAN BE 0-3) 



TO THE ADDRESS E 



SETS BREAKPOINT TO THE 

SETS BREAKPOINT N (WHERE 

CLEARS ALL BREAKPOINTS 

LISTS ALL BREAKPOINTS 

CLEARS BREAKPOINT N 

STARTS EXECUTION AT LOCATION E 

REPLACES ;A WITH CURRENT INPUT FILE (COMMANDS -FROM OR TTY) 

AND THEN STARTS EXECUTION AT LOCATION E 
RESTARTS EXECUTION AT THE VALUE OF THE LOCATION COUNTER 
RESTARTS EXECUTION AT ;L AND BREAKS AFTER N BREAKPOINTS 

HAVE BEEN REACHED 
EXECUTES THE NEXT INSTRUCTION AND THEN BREAKS 
EXECUTES N INSTRUCTIONS AND BREAKS 

EXECUTES THE NEXT INSTRUCTION, BREAKS, AND REPEATS THIS 
SEQUENCE N TIMES 

RESTARTS EXECUTION AT ;L AND THEN BREAKS AT EACH BREAKPOINT 
FOR N BREAKPOINTS 

CAUSES POP'S TO BE TREATED AS ONE INSTRUCTION FOR ;N & ;S 
TRACES ALL POP'S 

CAUSES BRM'S AND SBRM »S TO BE TREATED AS ONE INSTRUCTION 
TRACES ALL BRM'S AND SBRM'S 



c 



( 13) 



INPUT/OUTPUT 



;T /FILE/ LOADS BINARY FILE AND SYMBOL TABLE AT LOCATION ;F 

(ALSO LOADS SAVE OR GO TO TYPE FILES) 
E;T /FILE/ LOADS BINARY FILE AND SYMBOL TABLE AT LOCATION E 
;Y LOADS BINARY FILE AND EXTERNAL SYMBOLS AT LOCATION ;F 

EJY LOADS BINARY FILE AND EXTERNAL SYMBOLS AT LOCATION E 

;W /FILE/ CAUSES ALL GLOBAL SYMBOLS TO BE WRITTEN ON THE SPECIFIED FILE 
;C /FILE/ CAUSES ALL SYMBOLS TO BE WRITTEN ON THE SPECIFIED FILE 



(14) ARITHMETIC (FOR EXPRESSIONS) 



E1+E2 PERFORMS INTEGER ADDITION 

E1-E2 SUBTRACTS E2 FROM El 

E1J/E2 DIVIDES El BY. E2 

Ei;*E2 MULTIPLIES El BY E2 

El E2 (SPACE) SAME AS + 

E1j:N TAKES El MOD N (DIVIDES El 



BY N AND TAKES REMAINDER) 



(15) LOGICAL (FOR EXPRESSIONS) 



(RESULTS ARE 1 FOR TRUE AND FOR FALSE) 



Ei;<E2 MAKES COMPARISON 

Ei;>E2 MAKES COMPARISON 

Ei;=E2 MAKES COMPARISON 

El;%E2 PERFORMS LOGICAL 

Ei;&E2 PERFORMS LOGICAL 

Ei;.E2 PERFORMS LOGICAL 



EKE2 

E1>E2 

E1=E2 

"OR" (MERGES El AND E2) 

"AND" (EXTRACTS El AND E2) 

EXCLUSIVE OR (DOES MACHINE INST. EOR) 



(16) ddt state and relabelling 

e1,e2;r sets program relabelling registers to el and e2 

;z releases program memory and resets ddt to original state 

;k kills all symbols 

%e resets ddt to original state without releasing program memory 

%d saves current ddt state by relabelling out symbol table 

(types n for the following) 
n%r restores ddt to state saved by %d command 

(17) miscelaneous 

;@ stops typing of carriage returns & reduces tab from 3 to 1 space 

(used when ddt running under co mm and- from programs) 
% ,, string ,, commands- from comment mode, copies all characters from the 

input file to the output file until the second " 
? ignores last typed instruction or string 

* INDIRECT (USED AFTER AN OP CODE, VALUE OF 4O0O0B) 

CURRENT LOCATION 
> SEPARATOR 

%F RETURN TO THE EXEC 

"ASC DtC X ENTERS ASCII WORD OF 3 OR 4 CHARACTERS DEPENDING UPON MODE 

WHERE X IS A COMMAND FROM GROUPS ( 2) , ( 6) * ( 1 0), ETC 

(DtC) IS CONTROL D 

n§ operation code definition 

e;u execute instruction (instruction stored at ;f) 

%v types version no. 

(18) STRING 

e;- types e as a SYMBOLIC STRING POINTER in the form 

ei;+n WHERE El is the symbol OF E/3 and n is the remainder 

E; • ASSUMES E IS ADDRESS OF A PAIR OF STRING POINTERS; TYPES STRING 
EJ+N MAKES A STRING POINTER BY TAKING E *.3 + N 
E;-N MAKES A STRING POINTER BY TAKING E * 3 - N 



